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MAMMALIAN PROSTAGLANDIN H SYNTHASE* 2 

1. INTRODUCTION 

The present invention relates to the gene encoding the 
5 mammalian prostaglandin H synthase-2, hereinafter "PGHS-2, 1 ' 
and its product. Mammalian prostaglandin H synthase-1, 
hereinafter "PGHS-l," is responsible for the constitutive 
prostaglandin synthesis in mammalian physiology, PGHS-2 was 
discovered to be responsible for the increased prostaglandin 

10 synthesis associated with inflammation. The invention 
relates to PGHS-2 and to compounds which specifically 
modulate the expression of PGHS-2 and not PGHS-1 including 
but not limited to nucleic acid encoding PGHS-2 and 
homologues, analogues, and deletions thereof, as well as 

15 antisense, ribozyme, triple helix, antibody, and polypeptide 
molecules and small inorganic molecules. The invention 
further relates to methods of diagnosing an aberrant PGHS-2 
gene and gene product as well as gene therapies for use as* a 
remedy for such aberrant PGHS-2 gene or gene product. In 

20 addition, the invention relates to pharmaceutical 

formulations and routes of administration for such remedies. 

2. BACKGROUND OF THE INVENTION 

Prostaglandins (which include PGEj, PGD 2 , PGF^, PGI 2 and 
25 other related compounds) represent a diverse group of 

autocrine and paracrine hormones that are derived from the 
metabolism of fatty acids. They belong to a family of 
naturally occurring eicosanoids (prostaglandins, thromboxanes 
and leukotrienes) which are not stored as such in cells, but 
30 are biosynthesized on demand from arachidonic acid, a 20- 

carbon fatty acid that is derived from the breakdown of cell- 
membrane phospholipids. Under normal circumstances, the 
eicosanoids are produced at low levels to serve as important 
mediators of many and diverse cellular functions which can be 
35 very different in different types of cells. However, the 
prostaglandins also play critical roles in pathophysiology. 
In particular, inflammation is both initiated and maintained, 
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at least in part, by the overproduction of prostaglandins in 
injured cells. The central role that prostaglandins play in 
inflammation is underscored by the fact that those aspirin- 
like non-steroidal anti-inflammatory drugs (NSAIDS) that are 
5 most effective in the therapy of many pathological 

inflammatory states all act by inhibiting prostaglandin 
synthesis. Unfortunately, the use of these drugs is often 
limited by the side effects (gastrointestinal bleeding, 
ulcers, renal failure, and others) that result from the 

10 undesirable reduction in prostaglandins in normal cells that 
now suffer from a lack of those autocrine and paracrine 
functions that are required for the maintenance of normal 
physiology. The development of new agents that will act more 
specifically by achieving a reduction in prostaglandins in 

15 inflamed cells without altering prostaglandin production in 
other cells is one of the major goals for future medicinal 
therapy. 

The cyclooxygenase reaction is the first step in the - 
prostaglandin synthetic pathway; an enzyme (PGHS) with 

20 prostaglandin G/H synthetic activity converts arachidonic 
acid into the endoperoxide PGG 2 , which then breaks down to 
PGH 2 (the two reactions are carried out by a single enzyme) . 
PGH 2 is in turn metabolized by one or more prostaglandin 
synthase (PGE 2 synthase, PGD 2 synthase etc.) to generate the 

25 final "2-series" prostaglandins, PGEj, PGD 2 , PGF^, PGI 2 and 
others which include the thromboxanes, TXA 2 . The first step 
(PGHS) is the one that is rate-limiting for prostaglandin 
synthesis. As such, the PGHS-mediated reaction is the 
principal target for anti-inflammatory drug action; and it is 

30 inhibition of PGHS activity that accounts for the activity of 
the NSAIDS (aspirin, acetominophen , ibuprofen, naproxen, 
indomethacin) and others that limit the overproduction of 
prostaglandins in inflammation (the desired therapeutic goal) 
and reduce the normal production of prostaglandins in 

35 uninflamed cells (which produces the undesirable side 
effects) . 
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In addition to the abnormal changes associated with 
inflammation, multiple other factors are known to influence 
prostaglandin production under experimental conditions. 
These include growth factors, cAMP, tumor promoters, src 
5 activation and interleukins 1 and 2, all of which increase 
overall cellular PGHS activity. The adrenal glucocorticoid 
hormones and related synthetic anti- inflammatory steroids 
also inhibit prostaglandin synthesis, but their metabolic 
site of action is not well defined. 
10 Human, ovine, and murine cDNAs have been cloned for 

PGHS-1. All show similar sequences and hybridize with 2.8- 
3.0-kb mRNAs on Northern blots. However, several research 
groups have recently identified and predicted the sequence of 
a protein reported to be related to PGHS-1 in some manner. 
15 m 1990, Han et al., 1990, Proc. Nat'l. Acad. Sci. USA, 

87:3373-3377 reported changes in protein synthesis caused by 
the polypeptide pp60 v,rc , following infection of BALB/c 3T3 
fibroblasts by Rous sarcoma virus temperature-sensitive 
mutant strain LA90. Giant two-dimensional gel 
20 electrophoresis detected induction of a 72-74 kDa protein 
doublet that is recognized by anticyclooxygenase antibodies. 
Synthesis of this doublet was also transiently increased by 
exposure to platelet-derived growth factor and inhibited by 
dexamethasone treatment. These changes in protein synthesis 
25 were strongly correlated with changes in cyclooxygenase 
activity. The protein doublet was also seen in mouse C127 
fibroblasts where its synthesis was found to be regulated by 
serum and dexamethasone and correlated with cyclooxygenase 
activity. See O'Banion et al., 1991, J. Biol. Chem., 
30 266:23261-23267. 

Xie et al., 1991, Proc. Nat 1 !. Acad. Sci. USA, 2692-2696 
followed Han's et al. earlier report with the isolation of a 
set of cDNAs corresponding to pp60 v_tre inducible form 
"miPGHS^" , for mitogen- inducible PGHS^^. Although Xie et 
35 a i. speculated that prostaglandin synthesis may play a role 
in src product-mediated cellular transformation, their 
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experiments did not permit them to discriminate between 
miPGHS^ as a second cyclooxygenase or simply as the chicken 
homolog of sheep PGHS-1, "PGHS W ". 

In a separate set of experiments/ Kujubu et al., 1991 
5 J. Biol. Chem. f 266:12866-12872 reported that one of the 
primary response genes cloned from mitogen-responding Swiss 
3T3 cells (TIS10) has a long 3 1 -untranslated region and 
encodes a "predicted" 66 kDa protein which is about 60% 
identical to mouse PGHS-1. The sequence of this putative 

10 protein was essentially identical to that derived by Xie et 
al. On the basis of sequence similarities, Kujubu et al. 
speculated that the enzymatic activity of the protein encoded 
by the TIS10 gene would be likely to be "similar" to 
enzymatic activity of other types of mammalian PGHS-l. They 

15 concluded, that "[p]roof of this conjecture, however, awaits 
the heterologous expression of this gene production from an 
expressible plasmid and the direct measurement of 
cyclooxygenase activity in transfected cells and/or purified 
preparations of the TIS10 protein." 

20 There is increasing emphasis on the development of 

methods for the modulation and evaluation of the activity of 
the prostaglandin synthetic pathway. As noted above, 
nonsteroidal anti-inflammatory agents, such as aspirin and 
indomethacin, inhibit the cyclooxygenase which converts 

25 arachidonic acid into PGG 2 and PGH 2 , Therefore, there is a 
need for improved methods to study the effectiveness of 
existing anti-inflammatory drugs and to evaluate the 
effectiveness of potential anti-inflammatory agents, at the 
molecular level, as well as for reagents for use in such 

30 methods. 

3. SUMMARY OF THE INVENTION 

The invention relates to the gene encoding the mammalian 
prostaglandin H synthase-2 and its product. The invention is 
35 based, in part, on the discovery that there are two PGHS 

genes; one constituitively expressed and termed herein PGHS- 
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1, and a second which is responsive to regulatory control and 
termed herein PGHS-2. More specifically, the invention 
relates to the diagnosis of an aberrant PGHS-2 gene or gene 
product; the identification, production, and use of compounds 
5 which modulate PGHS-2 gene expression or the activity of the 
PGHS-2 gene product including but not limited to nucleic acid 
encoding PGHS-2 and homologues, analogues, and deletions 
thereof, as well as antisense, ribozyme, triple helix, 
antibody, and polypeptide molecules and small inorganic 

10 molecules; and pharmaceutical formulations and routes of 
administration for such compounds. The invention also 
relates to the identification of naturally occurring cells 
and the creation of cells that express PGHS-1 or PGHS-2 
exclusively and the use of such cells in drug screening. 

15 m the examples described infra, it is shown that a 

second PGHS gene, PGHS-2, has been identified in mouse and in 
human cells which is distinct from the PGHS-1 gene. It is 
further shown that PGHS-2 expression is responsive to 
regulatory control while PGHS-1 expression is constitutive. 

20 An assay employing PGHS-2 transf ectants was used to 

successfully identify compounds which modulate the expression 
of the PGHS-2 gene. Assays for the activity of the PGHS-2 
gene product are also described. In addition assays 
employing PGHS-2 and PGHS-1 transf ectants are described for 

25 use in identifying compounds which modulate the expression of 
the PGHS-2 gene and not the PGHS-1 gene. 

3.1. DEFINITIONS 

As used herein, the following terms and abbreviations 
30 shall have the meanings indicated below: 



base pair(s) bp 

complementary DNA cDNA 

counts per minute cpm 

deoxyribonucleic acid DNA 
kilobase pairs 

kilodation kDa 
micrograms 

micrometer J™ 
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3.1. DEFINITIONS 

As used herein, the following terms and abbreviations 
30 shall have the meanings indicated below: 



base pair(s) 
complementary DNA 
counts per minute 
deoxyribonucleic acid 
kilobase pairs 
kilodation 
micrograms 
micrometer 



-bp 



35 



CDNA 

cpm 

DNA 

kb 

kDa 

Mg 
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nanograms ng 

nanometer nm 

nucleotide nt 

polyacrylamide gel electrophoresis PAGE 

polymerase chain reaction PGR 

prostaglandin H synthase PGHS 

5 radioimmunoassay RIA 

ribonucleic acid RNA 

sodium dodecyl sulfate SDS 

units u 



10 As used herein, the word "modulate" shall have its usual 

meaning, but shall also encompass the meanings of the words 
enhance, inhibit, and mimic. In addition, as used herein, 
the word "expression" when used in connection with a gene 
such as PGHS-2 shall have its usual meaning, but shall also 

15 encompass the transcription of the gene, the longevity of 

functional mRNA transcribed from the gene, the translation of 
that mRNA, and the activity of the gene product. 

4. DESCRIPTION OF THE DRAWINGS 

20 FIG. 1 depicts the cDNA (SEQ ID NO:l) and predicted 

amino acid sequence (SEQ ID NO: 2) of murine griPGHS ("PGHS- 
2"). The standard one letter code for amino acids is used. 
Based on a transcription start site determined by primer 
extension at -24, the numbering of this sequence starts at 

25 25. A predicted signal peptide cleavage site between amino 
acids 17 and 18 is marked with an arrowhead. The position of 
the putative aspirin-modified serine is indicated by a 
circle, and potential N-glycosylation sites are double 
underlined. 

30 FIG* 2 is a schematic depiction comparing the cDNA and 

protein sequences for the murine 2.8- and 4.1 kb RNA-encoded 
cyclooxygenases. cDNA structures for the 4.1 kb cDNA cloned 
from C127 cells and the murine 2.8 kb cDNA are drawn as the 
thick lines at top and bottom. The numbering of the 4.1 kb 

35 cDNA is based on primer extension data. Since the 5 1 end of 
the 2.8 kb mouse mRNA has not been determined, no numbers 
have been assigned to the translation start and stop sites. 
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Alternative polyadenylation sites established from other cDNA 
clones are indicated with "A" and the S^AUUl^A-a 1 motifs are 
identified by dots underneath the sequence. These motifs are 
not found in the 2.8 kb cDNA. Deduced protein sequences are 
5 drawn colinearly with gaps (17 aa at the amino-terminal end 
of the 4.1 kb mRNA product, and 18 aa at the carboxy-terminal 
end of the 2.8 kb mRNA product) indicated by connecting 
lines. The 26 aa leader sequence for the 2.8 kb PGHS is 
indicated. Although its extent has not been precisely 

10 defined, a shorter, nonhomologous leader appears to exist for 
griPGHS with a mature N-terminal end at amino acid 18. The 
positions of potential N-glycosylation sites (NXS/T, "N") and 
the conserved aspirin modified serines are noted on each 
molecule. The hatched areas near the center of each molecule 

15 denote presumed axial (TIWLREHNRV, identical between the two 
molecules) and distal ( KALGH / RGLGH ) heme-binding sites as 
suggested by DeWitt et al., 1990, J. Biol. Chem. 265:5192- 
5198. Interestingly, the RGLGH sequence in griPGHS fits the 
consensus RXXXH distal heme-binding site described for other 

20 peroxidases, Kimura and Ikeda-Saito, 1988, Prot. Struc. Func. 
Genetics 3, 113-120, and supports the previous suggestion 
that KALGH serves the same purpose in the 2.8 kb gene 
product, DeWitt et al., 1990, J. Biol. Chem. 265-5192-5198. 
The bar at the bottom of the figure represents the 

25 similarities between the two mouse PGHS proteins (omitting 
the nonconserved N— and C-termini) as the percentage of 
identical residues for groups of 20 amino acids with 
increasing shading indicating 40-55% (no shading), 60-75%, 
80-95%, and 100% identity. The overall identity is 64% and 

30 with conservative changes the similarity index is 79%. 
FIGS. 3A-3B are a photographic depiction of 
autoradiographies obtained by Northern blotting, monitoring 
the expression of the genes encoding griPGHS and the 
constitutive PGHS-1, as expressed in human monocytes, in 

35 response to inter leukin-1 treatment, a known mediator of 

inflammation. Adherent human monocytes isolated from healthy 
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donors were suspended in medium without serum at 1 x 10 6 
cells/ml. One ml aliquots in 5 ml polypropylene tubes were 
incubated with loosened caps in 5% C0 2 at 37 °C with occasional 
shaking. Figures 3A-3B are more fully described as follows: 
5 FIG. 3A: Monocytes were incubated for 4 h in the 

presence or absence of dexamethasone (ImM) prior to 
total RNA isolation. Five jig was subjected to 
Northern blot analysis with the indicated probes. 
FIG. 3B: Monocytes were treated with dexamethasone 
10 (1/zM), 1L-10 (10 half-maximal units, Collaborative 

Research) , or both for the indicated times prior to 
RNA isolation. Cycloheximide (25 /iM) was added to 
one set of incubations 15 min prior to he addition 
of cytokine or hormone. 
15 FIG. 4 is a schematic depiction of griPGHS expression 

vector construction. griPGHS was prepared for directional 
subcloning into the pRC/CMV expression vector (Invitrogen) by 
digestion with Acc I, Klenow fill-in, and digestion with Not 
I. This fragment, extending from the Not I site 50 bases 
20 upstream of the cDNA end to nt 1947 of the cDNA, contains the 
full-coding region truncated immediately before any 5'-AUUUA- 
3 1 mRNA destabilizing regions, O'Banion et al., 1992, Proc. 
Nat*l. Acad. Sci. USA, 89:4888-4892. The pRc/CMV vector DNA 
was digested with Xba I, filled in with Klenow, then digested 
25 with Not I. The dots in the 3' untranslated region of 
griPGHS indicate the locations of 5 1 -AUUUA 1 -3 • mRNA 
destabilizing sequences. ,f A" represents alternative 
polyadenylation sites, "N" represents potential glycosylation 
sites, and "SER 516" marks the location of the aspirin- 
30 acetylated serine. 

FIGS. 5A-5D are a graphic depiction of the inhibition of 
murine griPGHS activity in stable transfected mammalian cell 
lines by preselected amounts of several non-steroidal anti- 
inflammatory drugs. Figures 5A-5D are more fully described 
35 as follows: 

FIG. 5A: Acetominophen. 



- 8 - 



WO 96/40720 PCT/US96/08311 

donors were suspended in medium without serum at 1 x 10 6 
cells/ml. One ml aliquots in 5 ml polypropylene tubes were 
incubated with loosened caps in 5% C0 2 at 37 °C with occasional 
shaking. Figures 3A-3B are more fully described as follows: 
5 FIG. 3A: Monocytes were incubated for 4 h in the 

presence or absence of dexamethasone (1/iM) prior to 
total RNA isolation. Five pg was subjected to 
Northern blot analysis with the indicated probes. 
FIG. 3B: Monocytes were treated with dexamethasone 
10 (1/*M), 1L-1/J (10 half-maximal units, Collaborative 

Research) , or both for the indicated times prior to 
RNA isolation. Cycloheximide (25 /itM) was added to 
one set of incubations 15 min prior to he addition 
of cytokine or hormone. 
15 FIG. 4 is a schematic depiction of griPGHS expression 

vector construction. griPGHS was prepared for directional 
subcloning into the pRC/CMV expression vector (Invitrogen) by 
digestion with Acc I, Klenow fill-in, and digestion with Not 
I. This fragment, extending from the Not I site 50 bases 
20 upstream of the cDNA end to nt 1947 of the cDNA, contains the 
full-coding region truncated immediately before any 5'-AUUUA- 
3* mRNA destabilizing regions, O'Banion et al., 1992, Proc. 
Nat'l. Acad. Sci. USA, 89:4888-4892. The pRc/CMV vector DNA 
was digested with Xba I, filled in with Klenow, then digested 
25 with Not I. The dots in the 3' untranslated region of 
griPGHS indicate the locations of 5 1 -AUUUA 1 -3 1 mRNA 
destabilizing sequences. "A" represents alternative 
polyadenylation sites, M N M represents potential glycosylation 
sites, and "SER 516" marks the location of the aspirin- 
30 acetylated serine. 

FIGS. 5A-5D are a graphic depiction of the inhibition of 
murine griPGHS activity in stable transf ected mammalian cell 
lines by preselected amounts of several non-steroidal anti- 
inflammatory drugs. Figures 5A-5D are more fully described 
35 as follows: 

FIG . 5A : Acetominophen . 



WO 96/40720 



PCT/US96/08311 



FIG* 5B: Ibuprofen. 
FIG. 5C: Naproxen. 
FIG. 5D: Indomethacin. 
FIGS. 6A-6B depict the nucleotide sequence of the human 
5 PGHS-2 gene (SEQ ID NO: 3). Figures 6A-6B are more fully 
described as follows: 

FIG. 6A: Nucleotides 90-1049. 
FIG. 6B: Nucleotides 1050-1923. 
FIG. 7 depicts a comparison between the amino acid 
10 sequence of human PGHS-2 of the present invention (upper 

sequence) (SEQ ID NO: 4) and the amino acid sequence published 
by Hla et al. (lower sequence) (SEQ ID NO: 5). The sequences 
are given in standard single letter code. 

FIGS. 8A-8D are a graphical depiction of the inhibition 
15 of human PGHS-2 activity in stably transformed COS cells by 
four non-steroidal anti-inflammatory drugs (NSAID) : 
Acetominophen; Ibuprofen; Naproxen; and Indomethacin. 
Figures 8A-8D are more fully described as follows: 
FIG. 8A: Acetominophen. 
20 FIG. 8B: Ibuprofen. 

FIG. 8C: Naproxen. 
FIG. 8D: Indomethacin. 
FIGS. 9A-9D are a graphical depiction of the inhibition 
of human PGHS-1 activity in stably transformed COS cells by 
25 four NSAID: Acetominophen; Ibuprofen; Naproxen; and 

Indomethacin. Figures 9A-9D are more fully described as 
follows: 

FIG. 9A: Acetominophen. 
FIG. 9B: Ibuprofen. 
30 FIG. 9C: Naproxen. 

FIG. 9D: Indomethacin. 
FIGS. 10A-10D show a nucleic acid sequence comparison 
between the coding regions of human PGHS-2 and PGHS-1. Solid- 
lined-boxes indicate regions where the sequence of PGHS-2 is 
35 least homologous to that of PGHS-1. Dashed-lined-boxes 
indicate regions where the sequence of PGHS-2 is most 
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homologous to that of PGHS-1. Figures 10A-10D are more fully 
described as follows: 

FIG. XOA: PGHS-2 nucleotides 1-467. 
FIG. 10B: PGHS-2 nucleotides 469-1004. 
5 FIG. 10C: PGHS-2 nucleotides 1006-1537. 

FIG. 10D: PGHS-2 nucleotides 1540-1834. 
FIGS. 11A-11C show the nucleic acid sequence of the 5' 
promoter region of human PGHS-2 as compared with that of 
PGHS-1. Dashed-lined-boxes indicate the regions where the 
10 sequence of the PGHS-2 5' region is most homologous to that 
of PGHS-1. Figures 11A-11C are more fully described as 
follows: 

FIG. 11A: PGHS-2 promoter nucleotides 1-950. 
FIG. 11B: PGHS-2 promoter nucleotides 951-1900. 
15 -FIG. 11C: PGHS-2 promoter nucleotides 1901-2400. 

5. DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a mammalian cell line 
which contains a chromosomally integrated, recombinant DNA 
20 sequence, which DNA sequence expresses mammalian, preferably 
human, glucocorticoid-regulated inflammatory PGHS, and which 
cell line does not significantly express autologous PGHS-1 or 
PGHS-2 activity. For brevity, glucocorticoid-regulated 
inflammatory PGHS will hereinafter be referred to as 
25 "griPGHS" or "PGHS-2", and the art-recognized mammalian PGHS 
encoded by the 2.8-3.0 kb mRNA (EC 1.14.99.1) will be 
referred to as -constitutive cyclooxygenase," or 
"constitutive PGHS," or "PGHS-1." The recitation that there 
is no "autologous PGHS-1 or PGHS-2 activity" relates to the 
30 inability of the cell line to express PGHS activity apart 
from that expressed by the recombinant DNA sequence. 
Autologous PGHS activity may also be referred to as 
"endogenous" PGHS activity in the art. 

This invention is a result, in part, of the discovery 
35 that the 72-74 kDa cyclooxygenase reported by Han et al., the 
miPGHS* reported by Xie et al., and the TIS10 protein 
reported by Kujubu et al. are essentially identical and 
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represent a second cyclooxygenase, which second form is the 
primary target for inhibition by glucocorticoids and is also 
a target for inhibition by non-steroidal anti-inflammatory 
agents . 

5 The synthesis of a 70 kilodalton (kDa) protein in C127 

mouse fibroblasts, via a mouse 4 kilobase (kb) mRNA, and the 
derived amino acid sequence was reported. The protein 
encoded by the 4-kb mRNA shows 80% amino acid identify with 
the previously known mouse PGHS-1 protein product in a 

10 sequenced 240 base region. See ©•Banion et al., 1991, J. 
Biol. Chem., 35:23261-23267. 

The 70 kDa protein, designated griPGHS or PGHS-2 herein, 
was determined to be a discrete form of cyclooxygenase by 
several assays. The protein was precipitated by anti-PGHS 

15 serum, its synthesis and concomitant cyclooxygenase levels 
are rapidly induced by serum, and the induction is inhibited 
by dexamethasone. The regulation of PGHS-2 synthesis was 
found not to arise from alterations in the level of the 2.8- 
kb PGHS-1 mRNA, but resulted from changes in the level of a 

20 4-kb mRNA species. This latter species is barely detectable 
with a 2.8-kb PGHS-1 DNA probes in cells treated with serum, 
but accumulates to significant levels in cells treated with 
cycloheximide or calcium ionophores. In contrast, there was 
no change in the level of, the 2.8-kb mRNA which encodes 

25 PGHS-1 or "constitutive PGHS" as observed following treatment 
with serum, dexamethasone or cycloheximide. Finally, by 
hybridization analysis, it was shown that the 4-Kb mRNA 
represented the product of a gene that is distinct from the 
gene giving rise to the 2. 8 -Kb mRNA. 

30 These observations indicated that there are two 

cyclooxygenase genes; one constitutively expressed as a 2.8- 
kb mRNA, and a second giving rise to a growth factor and 
glucocorticoid-regulated 4-kb mRNA which encodes PGHS-2. It 
is believed that expression of the latter 4-kb RNA and 
35 concomitantly increased PGHS-2 levels are primarily, if not 
entirely, responsible for the enhanced prostaglandin 
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synthesis that is responsible, directly or indirectly, for 
many of the adverse effects of inflammation. 

The primary and perhaps sole action of most non- 
steroidal anti-inflammatory agents is to inhibit the enzyme 
5 prostaglandin G/H synthase, also known as cyclooxygenase, 
which serves as the first committed step in the biosynthesis 
of prostaglandins. PGHS-2 is a unique isoform of 
cyclooxygenase, which in contrast to the previously cloned, 
constitutively expressed enzyme, is driamatically up-regulated 

10 by growth factors * tissue injury, and proinflammatory 

cytokines, and down-regulated by glucocorticoids (O'Banion et 
al., 1991, J. Biol. Chem. , 266:23261-23267; O'Banion et al., 
1992, Proc. Nat 1 !. Acad. Sci. USA, 89:4888-4892: Pritchard et 
al., 1994, J. Biol. Chem., 269:8504-8509). Recent studies 

15 utilizing specific pharmacological inhibitors of PGHS-2 

confirm that it plays a major role in peripheral inflammation 
(Futaki et al., 1993, J. Pharm. Pharmacol., 45:753-755; 
Masferrer et al., 1994, Proc. Natl. Acad. Sci. USA, 91: 3228- 
3232; Vane et al., 1994, Proc. Nat'l. Acad. Sci. USA, 

20 91:2046-2050) . 

The present invention also comprises an isolated DNA 
sequence (gene) encoding biologically active human PGHS-2; 
antisense and ribozyme molecules specific for the PGHS-2 
transcript; polynucleotide molecules which form a triple 

25 helix at the 5» region of the PGHS-2 gene and thereby prevent 
or reduce transcription of the gene; the isolated, 
essentially pure human PGHS-2 gene product; antibodies to the 
gene product; continuous cell lines engineered to stably 
express PGHS-2; assays for screening compounds, including 

30 peptides, polynucleotides, and small organic molecules to 

identify those that inhibit the expression or activity of the 
PGHS-2 gene product; and methods of treating diseases 
characterized by aberrant PGHS-2 activity using such 
compounds • 

35 

5.1. DNA ENCODING MAMMALIAN PGHS-2 
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essentially pure human PGHS-2 gene product; antibodies to the 
gene product; continuous cell lines engineered to stably 
express PGHS-2; assays for screening compounds, including 

30 peptides, polynucleotides, and small organic molecules to 

identify those that inhibit the expression or activity of the 
PGHS-2 gene product; and methods of treating diseases 
characterized by aberrant PGHS-2 activity using such 
compounds . 

35 

5.1. DNA ENCODING MAMMALIAN PGHS-2 
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The screening of a murine cDNA library enriched in the 4 
kb mRNA of O f Banion et al., 1991, J • Biol. Chem., 35:2326- 
23267 with a radiolabeled portion of the 2.8 kb PGHS cDNA 
revealed a 4.1 kb sequence (Fig. 1). Comparison of the 4.1 

5 kb sequence with that of the previously cloned mouse 2.8 kb 
PGHS cDNA revealed a single open reading frame with 64% amino 
acid identity to the protein encoded by the 2.8 kb PGHS cDNA, 
©•Banion et al., 1992, Proc. Nat»l. Acad. Sci. USA, 89:4888- 
4892. This 4.1 kb sequence is designated PGHS-2, and the 2.8 

10 kb sequence is designated PGHS-1. The reduced amino acid 
sequences are colinear except that PGHS-2 has a shorter 
amino-terminus and longer carboxy-terminus than PGHS-1. 

Three of four potential N-glycosylation sites are 
conserved between the two molecules and there is particularly 

15 high similarity in the regions surrounding a putative axial 
heme-binding domain (amino acids 273-342) and the region 
around the presumed aspirin modif ied-serine 516 (amino acids 
504-550) . By far the largest difference in the two cDNAs is 
the presence of a 2.1 kb 3 1 untranslated region in the 4.1 kb 

20 cDNA. This region is rich in S'-AUUUA-S 1 motifs that are 
associated with the decreased stability of many cytokine and 
protooncogene mRNAs. The presence of these motifs is 
consistent with the profound superinducibility of the 4.1 kb 
mRNA by cycloheximide, which is not observed for the 2.8 kb 

25 mRNA. 

Figure 2 schematically compares cDNA and protein 
sequences for the murine 2.8 and 4.1 kb mRNA-encoded 
cyclooxygenases. cDNA structures for the 4.1 kb cDNA cloned 
from murine C127 cells and the murine 2.8 kb cDNA (DeWitt et 

30 al., 1990, J. Biol. Chem., 265:5192-5198 are drawn as the 
thick lines at top and bottom. The numbering of the 4.1 kb 
cDNA is based on primer extension data. Since the 5 1 end of 
the 2.8 kb mouse mRNA has not been determined, no numbers 
have been assigned to the translation start and stop sites. 

35 Alternative polyadenylation sites established from other cDNA 
clones are indicated with "A" and the 5'-AUUUnA-3« motifs are 
identified by dots underneath the sequence. These motifs are 

- 13 - 
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not found in the 2.8 kb cDNA. Deduced protein sequences are 
drawn colinearly with gaps (17 aa at the amino-terroinal end 
of the 4.1 kb mRNA product , and 18 aa at the carboxy-terminal 
end of the 2.8 kb mRNA product) indicated by connected lines. 
5 The 26 amino acid (aa) leader sequence for the 2.8 kb PGHS is 
indicated. Although its extent has not been precisely 
defined, a shorter, nonhomologous leader appears to exist for 
griPGHS with a mature N-terminal end at amino acid 18. The 
positions of potential N-glycosylation sites (NXS/T, "N") and 

10 the conserved aspirin modified serines are noted on each 

molecule. The hatched areas near the center of each molecule 
denote presumed axial (TIWLREHNRV (SEQ ID NO:7) , identical 
between the two molecules) and distal (KALGH (SEQ ID NO: 8) / 
RGLGH (SEQ ID NO: 9)) heme-binding sites as suggested by 

15 DeWitt et al., cited above. The bar in the middle of the 

figure represents the similarities between the two mouse PGHS 
proteins (omitting the nonconserved N- and C-termini) as the 
percentage of identical residues for groups of 20 amino acids 
with increasing shading indicating 40-55% (no shading) , 60- 

20 75%, 80-95%, and 100% identity. The overall identity is 64% 
and with conservative changes the similarity index is 79%. 

Another specific embodiment of the invention is the 
human PGHS-2 gene and its product. The human PGHS-2 sequence 
differs from the human PGHS-2 sequence disclosed by Hla & 

25 Neilson, 1992, Proc. Nat'l. Acad. Sci. USA, 89:7384-7388, due 
to a glutamic acid (E) rather than a glycine (w) at amino 
acid position 165 of the PGHS-2 gene product (Figure 7) . The 
sequence for the PGHS-2 gene was confirmed by establishing 
the identity of the sequences of two other hPGHS-2 clones 

30 obtained from separate PCR runs, which demonstrates that the 
difference observed is not a PCR artifact. Furthermore, as 
shown in Figure 1, mouse PGHS-2 also has a glutamic acid at 
this position. While the human PGHS-2 nucleotide sequence is 
similar to that of the mouse, there are regions of 

35 substantial divergence. These divergent regions in the 
nucleotide sequence of the human PGHS-2 (FIGS. 6A-6B) 
include, but are not limited to: 

- 14 - 
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TCCACCCGCAGTACAGAAAGTATCACAGGCT 
1345 1405 

GTGTTCCAGATCCAGAGCTCATTAAAACAGT 
1797 1827 

5 PGHS-1 clones were similarly screened and the sequences of 

the PGHS-1 gene and enzyme confirmed to be identical to that 

shown in Figure 2 (SEQ ID NO: 6) in Yokahama and Tanabe, 1984 

Biochem. Biophys. Res. Commun., 165:888-894; see also, Hla, 

1986, Prostaglandins, 32:829-845. 

10 Fragments of the PGHS-2 DNA are also included within the 

scope of the invention. In a further embodiment of the 
invention, the PGHS-2 DNA or a modified sequence thereof may 
be ligated to a heterologous sequence to encode a fusion 
protein. For example, for screening peptide libraries it may 

1S be useful to encode a chimeric PGHS-2 protein expressing a 
heterologous epitope that is recognized by a commercially 
available antibody. A fusion protein may also be engineered 
to contain a cleavage site located between the PGHS-2 
sequence and the heterologous protein sequence, so that the 

20 PGHS-2 protein or protein fragment can be cleaved away from 
the heterologous moiety. In another embodiment, DNA 
sequences encoding a fusion protein comprising all or a 
portion of the PGHS-2 protein fused to another protein with a 
desired activity are within the scope of the invention; e.g., 

2 5 enzymes such as GUS (/3-glucuronidase) , /J-galactosidase, 
lucif erase, etc. 

In another embodiment, DNAs that encode mutant forms of 
PGHS-2 are also included within the scope of the invention. 
Such mutant PGHS-2 DNA sequences encompass deletions, 

30 additions and/or substitutions of nucleotide residues, or of 
regions coding for domains within the PGHS-2 protein. These 
mutated PGHS-2 DNAs may encode gene products that are 
functionally equivalent or which display properties very 
different from the native forms of PGHS-2. 

35 The invention contemplates, in addition to the DNA 

sequences disclosed herein, 1) any DNA sequence that encodes 
the same amino acid sequence as encoded by the DNA sequences 
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shown in Figures 1 and 6A-6B; 2) any DNA sequence that 
hybridizes to the complement of the coding sequences 
disclosed herein (see Figs, 1 and 6A-6B) under highly 
stringent conditions, e.g., washing in 0.1xSSC/0.1% SDS at 
5 68°C (Ausubel, et al., eds., 1989, Current Protocols in 

Molecular Biology, Vol. I, Green Publishing Associates, Inc., 
and John Wiley & sons, Inc., New York, at; p. 2.10.3) and 
encodes a functionally equivalent gene product; and/or 3) any 
DNA sequence that hybridizes to the complement of the coding 

10 sequences disclosed herein (see Figs. 1 and 6) under less 
stringent conditions, such as moderately stringent 
conditions, e.g., washing in 0.2xSSC/0.1% SDS at 42°C 
(Ausubel, et al., 1989, supra), yet which still encodes a 
functionally equivalent gene product. 

15 The invention also encompasses 1) DNA vectors that 

contain any of the coding sequences disclosed herein (see 
Figs. 1 and 6), and/or their complements (i.e., antisense) ; 
2) DNA expression vectors that contain any of the coding 
sequences disclosed herein (see Figs. 1 and 6) , and/or their 

20 complements (i.e., antisense), operatively associated with a 
regulatory element that directs the expression of the coding 
and/or antisense sequences; and 3) genetically engineered 
host cells that contain any of the coding sequences disclosed 
herein (see Figs. 1 and 6), and/or their complements (i.e., 

25 antisense) , operatively associated with a regulatory element 
that directs the expression of the coding and/or antisense 
sequences in the host cell. Regulatory element includes but 
is not limited to inducible and non-inducible promoters, 
enhancers, operators and other elements known to those 

30 skilled in the art that drive and regulate expression. The 
invention includes fragments of any of the DNA sequences 
disclosed herein. 

PGHS-2 sequence can be obtained from a variety of 
sources including cDNA libraries. For example, appropriate 

35 cDNA libraries which are good sources of PGHS-2 can be 
obtained from (Clonetech (Palo Alto, CA) , Stratagene (La 
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Jolla, CA) the ATCC Repository (Rockville, KD) . In addition, 
cDNA libraries may be prepared from mRNA pools collected from 
mammalian cells which express PGHS-2 either constitutively or 
inducibly. By way of example but not by way of limitation, 
5 such cells include C127 mouse fibroblasts and W138 human 
fibroblasts. The collection of mRNA pools and construction 
of cDNA libraries from these cells are set forth more fully 
in the examples described infra. 

Any of the cDNA libraries described above may be 

10 screened by hybridization or. PGR using the PGHS-2 sequences 
described herein as oligonucleotide probes. Screening can be 
performed using those portions of the PGHS-2 sequence which 
are not in PGHS-1, see Figs. 10A-10D. These sequences 
include the following regions in the nucleotide sequence of 

15 PGHS-2: 

171-254 
299-340 
486-512 
602-623 

20 1214-1250 

1283-1346 
1521-1580 
1718-1834 

In addition to cDNA libraries, partial PGHS-2 sequence 
25 can be obtained from any genomic library by library screening 
or from genomic DNA by PCR. Full cDNA sequences can be 
obtained by PCR of total RNA isolated from any cell or tissue 
that expresses PGHS-2 including, but not limited to, brain, 
heart and lung (where PGHS-2 is expressed without apparent 
30 inflammation) , as well as in many inflamed tissues such as 
synovial biopsies from rheumatoid arthritis. Cellular 
sources include, but are not limited to, primary and 
established cultures of fibroblasts, macrophages, endothelial 
cells, synoviocytes, vascular smooth muscle cells and 
35 astrocytes treated with growth factors, serum, inflammatory 
cytokines, calcium ionophores, or oncogenes, particularly if 
cycloheximide is included. 
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Alternatively, the cDNA libraries described above can be 
used to construct expression libraries in a cell line such as 
COS A2 which contains little or no autologous cyclooxygenase 
activity. These expression libraries can then be screened 
5 using antibodies which are specific to PGHS-2 and do not bind 
PGHS-1. Expression libraries for antibody screening may also 
be made in bacteria , such as E. coli t using phage vectors, 
such as lambda. Antibodies with specificity to PGHS-2 are 
commercially available through Cayman Chemical (Ann Arbor, 
10 MI), Oxford Biomedical Research, Inc. (Oxford, MI), and 

Transduction Laboratories (Lexington, KY) . These expression 
libraries may also be screened for PGHS-2 enzyme activity as 
set forth in the examples which are described in more detail 
infra . 

15 

5.2. EXPRESSING THE PGHS-2 GENE PRODUCT 

In order to express a biologically active PGHS-2, the 
coding sequence for the enzyme, a function equivalent, or a 
modified sequence, as described in Section 5.1., supra, is 

20 inserted into an appropriate eukaryotic expression vector, 
i.e., a vector which contains the necessary elements for 
transcription and translation of the inserted coding sequence 
in appropriate eukaryotic host cells which possess the 
cellular machinery and elements for the proper processing, 

25 i.e., signal cleavage, glycosylation, phosphorylation, 
sialylation, and protein sorting. Mammalian host cell 
expression systems are preferred for the expression of 
biologically active enzymes that are properly folded and 
processed. When administered in humans such expression 

30 products may also exhibit tissue targeting. 

The invention also encompasses peptide fragments of the 
PGHS-2 gene product. The PGHS-2 gene product or peptide 
fragments thereof, can be linked to a heterologous peptide or 
protein as a fusion protein. In addition, chimeric PGHS-2 

35 expressing a heterologous epitope that is recognized by a 
commercially available antibody is also included in the 
invention. A durable fusion protein may also be engineered; 
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i.e., a fusion protein which has a cleavage site located 
between the PGHS-2 sequence and the heterologous protein 
sequence, so that the PGHS-2 gene product, or fragment 
thereof, can be cleaved away from the heterologous moiety. 
5 For example, a collagenase cleavage recognition consensus 
sequence may be engineered between the PGHS-2 gene product, 
or fragment thereof, the heterologous peptide or protein. 
The PGHS-2 domain can be released from this fusion protein by 
treatment with collagenase. 

10 

5.2.1. CONSTRUCTION OP EXPRESSION VECTORS 
AND PREPARATION OF TRANSFECTANTS 

Methods which are well-known to those skilled in the art 

can be used to construct expression vectors containing the 

15 PGHS-2 coding sequence and appropriate transcriptional/ 

translational control signals. These methods include in 

vitro recombination/genetic recombination. See, for example, 

the techniques described in Sambook et al., 1987, Molecular 

Cloning A Laboratory Manual, Cold Spring Harbor Laboratory, 

20 N.Y., Chapter 12. 

Human PGHS-1 or PGHS-2 proteins produced by these 

methods would be useful for in vitro studies on the mechanism 

of action of the human forms of PGHS-1 and PGHS-2 and 

particularly for further studies on the mechanism of action 

25 of any inhibitors that are selective for PGHS-2 or PGHS-1 

that are identified by drug screening with the stably 

expressing PGHS-2 or PGHS-1 cell lines, as infra, or for 

investigating the mechanism of action of existing drugs or of 

inhibitors that may be identified by other means. The 

30 purified human PGHS-2 or PGHS-1 proteins would also be useful 

for the production of crystals suitable for X-ray 

crystallography. Such crystals would be extremely beneficial 

for the rational design of drugs based on molecular 

structure. Although the crystal structure for ovine PGHS-1 

35 has been obtained, this information is not yet available for 

either human PGHS-1 or PGHS-2. Expression of these chimeric 

DNA constructs in a baculovirus or yeast system and 
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the techniques described in Sambook et al., 1987, Molecular 
Cloning A Laboratory Manual, Cold Spring Harbor Laboratory, 

20 N.Y., Chapter 12. 

Human PGHS-1 or PGHS-2 proteins produced by these 
methods would be useful for in vitro studies on the mechanism 
of action of the human forms of PGHS-1 and PGHS-2 and 
particularly for further studies on the mechanism of action 

25 of any inhibitors that are selective for PGHS-2 or PGHS-1 
that are identified by drug screening with the stably 
expressing PGHS-2 or PGHS-1 cell lines, as infra, or for 
investigating the mechanism of action of existing drugs or of 
inhibitors that may be identified by other means. The 

30 purified human PGHS-2 or PGHS-1 proteins would also be useful 
for the production of crystals suitable for X-ray 
crystallography. Such crystals would be extremely beneficial 
for the rational design of drugs based on molecular 
structure. Although the crystal structure for ovine PGHS-1 
35 has been obtained, this information is not yet available for 
either human PGHS-1 or PGHS-2. Expression of these chimeric 
DNA constructs in a baculovirus or yeast system and 
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subsequent crystallization of the proteins would yield such 
data . 

A variety of eukaryotic host-expression systems may be 
used to express the PGHS-2 coding sequence. Although 
5 prokaryotic systems offer the distinct advantage of ease of 
manipulation and low cost of scale-up, their major drawback 
in the expression of PGHS-2 is their lack of proper post- 
translational modifications of expressed mammalian proteins. 
Eukaryotic systems, and preferably mammalian expression 

10 systems, allow for proper modification to occur. Eukaryotic 
cells which possess the cellular machinery for proper 
processing of the primary transcript glycosylation, 
phosphorylation, and, advantageously secretion of the gene 
product should be used as host cells for the expression of 

15 PGHS-2. Mammalian cell lines are preferred. Such host cell 
lines may include but are not limited to CHO, VERO, BHK, 
HeLa, COS, MDWCK, -293, WI38, etc. Alternatively, eukaryotic 
host cells which possess some but not all of the cellular 
machinery required for optional processing of the primary 

20 transcript and/ or post-translational processing and/or 

secretion of the gene product may be modified to enhance the 
host cell's processing capabilities. For example, a 
recombinant nucleotide sequence encoding a peptide product 
that performs a processing function the host cell had not 

25 previously been capable of performing, may be engineered into 
the host cell line. Such a sequence may either be co-trans- 
fected into the host cell along with the gene of interest, or 
included in the recombinant construct encoding the gene of 
interest. Alternatively, cell lines containing this sequence 

30 may be produced which are then transfected with the gene of 
interest. 

Appropriate eukaryotic expression vectors should be 
utilized to direct the expression of PGHS-2 in the host cell 
chosen. For example, at least two basic approaches may be 
35 followed for the design of vectors based on SV40. The first 
is to replace the SV40 early region with the gene of interest 
while the second is to replace the late region (Hammarskjold, 
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et al., 1986, Gene, 43:41-50. Early and late region 
replacement vectors can also be complemented in vitro by the 
appropriate SV40 mutant lacking the early or late region. 
Such complementation will produce recombinants which are 
5 packaged into infectious capsids and which contain the PGHS-2 
gene. A permissive cell line can then be infected to produce 
the recombinant protein. SV40-based vectors can also be used 
in transient expression studies, where best results are ob- 
tained when they are introduced into COS (CV-1, origin of 

10 SV40) cells, a derivative of CV-1 (green monkey kidney cells) 
which contain a single copy of an origin defective SV40 
genome integrated into the chromosome. These cells actively 
synthesize large T antigen (SV40) , thus initiating 
replication from any plasmid containing an SV40 origin of 

15 replication. 

In addition to SV40, almost every molecular ly cloned 
virus or retrovirus may be used as a cloning or expression 
vehicle. Viral vectors based on a number of retroviruses 
(avian and murine), adenoviruses, vaccinia virus (Cochran, et 

20 al., 1985, Proc. Natl. Acad. Sci. USA, 82:19-23) and polyoma 
virus may be used for expression. Other cloned viruses, such 
as J C (Howley, et al., 1980, J. Virol, 36:878-882), BK and 
the human papilloma viruses (Heilmsan, et al., 1980, J. 
Virol, 36:395-407), offer the potential of being used as 

25 eukaryotic expression vectors. For example, when using 

adenovirus expression vectors the PGHS-2 coding sequence may 
be ligated to an adenovirus transcript ion/ translation control 
complex, e.g., the late promoter and tripartite leader 
sequence. This chimeric gene may then be inserted in the 

30 adenovirus genome by in vitro or in vivo recombination. 
Insertion in a non-essential region of the viral genome 
(e.g., region El or E3) will result in a recombinant virus 
that is viable and capable of expressing the human enzyme in 
infected hosts (e.g., see Logan & Shenk, 1984, Proc. Natl. 

35 Acad. Sci. USA, 81:3655-3659). Alternatively, the vaccinia 
virus 7.5K promoter may be used, (e.g., see, Hackett et al., 
1982, Proc. Natl. Acad. Sci. USA, 79:7415-7419; Hackett et 
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al., 1994, J. Virol. 49:857-864, Panicali et al., 1982, Proc. 
Natl. Acad. Sci. USA, 79:4927-4931). Of particular interest 
are vectors based on bovine papilloma virus (Sarver, et al., 
1981, Mol. Cell. Biol., 1:486-496), or Semliki Forest Virus, 
5 which provides large quantities of active protein in induced 
cells (Olkkohnen et al., 1994, Meth. Cell. Biol., 43 part 
A:43-53; Lundstrum et al., 1994, Eur. J. Biochem. , 224:917- 
921) . These vectors have the ability to replicate as 
extrachromosomal elements. Shortly after entry of this DNA 

10 into mouse cells, the plasmid replicates to about 100 to 200 
copies per cell. Transcription of the inserted cDNA does not 
require integration of the plasmid into the host f s 
chromosome, thereby yielding a high level of expression. 
These vectors can be used for stable expression by including 

15 a selectable marker in the plasmid, such as the neo gene. 
High level expression may also be achieved using inducible 
promoters such as the metallothionine IIA promoter, heat 
shock promoters, etc. 

For long-term, high-yield production of recombinant 

20 proteins, stable expression is preferred. For example, 

following the introduction of foreign DNA, engineered cells 
may be allowed to grow for 1-2 days an enriched media, and 
then are switched to a selective media. Rather than using 
expression vectors which contain viral origins of 

25 replication, host cells can be transformed with the PGHS-2 
DNA controlled by appropriate expression control elements 
(e.g., promoter, enhancer, sequences, transcription 
terminators, polyadenylation sites, etc.), and a selectable 
marker. The selectable marker in the recombinant plasmid 

30 confers resistance to the selection and allows cells to 

stably integrate the plasmid into their chromosomes and grow 
to form foci which in turn can be cloned and expanded into 
cell lines, A number of selection systems may be used, 
including but not limited to the herpes simplex virus 

35 thymidine kinase (Wigler, et al., 1977, Cell, 11:223-232), 
hypoxanthine-guanine phosphor ibosylatransf erase (Szybalska & 
Szybalski, 1962, Proc. Natl. Acad. Sci. USA, 48:2026), and 
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adenine phosphor ibosyltransf erase (Lowy, et al., 1980, Cell, 
22:817-823) genes can be employed in tk", hgprt" or aprt* cells 
respectively. Also, antimetabolite resistance can be used as 
the basis of selection for dhfr, which confers resistance to 
5 methotrexate (Wigler, et al., 1980, Natl. Acad. Sci. USA 
77:3567-3570; O'Hafe, et al., 1981, Proc. Natl. Acad. Sci. 
USA 78:1527-1531); ygpt, which confers resistance to 
mycophenolic acid (Mulligan & Berg, 1981, Proc. Natl. Acad. 
Sci. USA, 78:2072-2076); neo, which confers resistance to the 

10 aminoglycoside G-418 (Colberre-Garapin, et al., 1981, J. Mol. 
Biol., 150:1-14); and hygro, which confers resistance to 
hygromycim (Santerre, et al., 1994, Gene, 30:147-156) genes. 
Recently, additional selectable genes have been described, 
namely trpB, which allows cells to utilize indole in place of 

15 tryptophan; hisD, which allows cells to utilize histinol in 
place of histidine (Hartman & Mulligan, 1988, Proc. Natl. 
Acad. Sci. USA, 85:8047-8051), and ODC (ornithine 
decarboxylase) which confers resistance to the ornithine 
decarboxylase inhibitor, 2- (dif luromethyl) -DL-ornithine, DFMO 

20 (McConlogue L. , 1987, In: Current Communications in Molecular 
Biology, Cold Spring Harbor Laboratory ed.). 

Alternative eukaryotic expression systems which may be 
used to express the PGHS-2 enzymes are yeast transformed with 
recombinant yeast expression vectors containing the PGHS-2 

25 coding sequence; insect cell system infected with recombinant 
virus expression vectors (e.gr., baculovirus) containing the 
PGHS-2 coding sequence; or plant cell systems infected with 
recombinant virus expression vectors (e.gr., cauliflower 
mosaic virus, CaMV; tobacco mosaic, TMV) or transformed with 

30 recombinant plasmid expression vectors (e.g., Ti plasmid) 
containing the PGHS-2 coding sequence. 

In yeast, a number of vectors containing constitutive or 
inducible promoters may be used. For a review see, Current 
Protocols in Molecular Biology, Vol. 2, 1988, Ed. Ausubel et 

35 al., Greene Publish. Assoc. & Wiley Interscience, Ch. 13; 
Grant et al., 1987, Expression and Secretion Vectors for 
Yeast, in Methods in Enzymology, Eds. Wu & Grossman, 31987, 



WO 96/40720 



PCT/US96/08311 



adenine phosphoribosyltransf erase (Lowy, et al., 1980, Cell, 
22:817-823) genes can be employed in tk", hgprt* or aprt" cells 
respectively. Also, antimetabolite resistance can be used as 
the basis of selection for dhfr, which confers resistance to 
5 methotrexate (Wigler, et al., 1980, Natl. Acad. Sci. USA 
77:3567-3570; O'Hare, et al., 1981, Proc. Natl. Acad. Sci. 
USA 78:1527-1531); ygpt, which confers resistance to 
mycophenolic acid (Mulligan & Berg, 1981, Proc. Natl. Acad. 
Sci. USA, 78:2072-2076); neo, which confers resistance to the 

10 aminoglycoside G-418 (Colberre-Garapin, et al., 1981, J. Mol. 
Biol., 150:1-14); and hygro, which confers resistance to 
hygromycim (Santerre, et al., 1994, Gene f 30:147-156) genes. 
Recently, additional selectable genes have been described, 
namely trpB, which allows cells to utilize indole in place of 

15 tryptophan; hisD, which allows cells to utilize histinol in 
place of histidine (Hartman & Mulligan, 1988, Proc. Natl. 
Acad. Sci. USA, 85:8047-8051), and ODC (ornithine 
decarboxylase) which confers resistance to the ornithine 
decarboxylase inhibitor, 2- (dif luromethyl) -DL-ornithine, DFMO 

20 (McConlogue L. , 1987, In: Current Communications in Molecular 
Biology, Cold Spring Harbor Laboratory ed.). 

Alternative eukaryotic expression systems which may be 
used to express the PGHS-2 enzymes are yeast transformed with 
recombinant yeast expression vectors containing the PGHS-2 

25 coding sequence; insect cell system infected with recombinant 
virus expression vectors (e.gr., baculovirus) containing the 
PGHS-2 coding sequence; or plant cell systems infected with 
recombinant virus expression vectors (e.g., cauliflower 
mosaic virus, CaMV; tobacco mosaic, TMV) or transformed with 

30 recombinant plasmid expression vectors (e.g.*, Ti plasmid) 
containing the PGHS-2 coding sequence. 

In yeast, a number of vectors containing constitutive or 
inducible promoters may be used. For a review see, Current 
Protocols in Molecular Biology, Vol. 2, 1988, Ed. Ausubel et 

35 al., Greene Publish. Assoc. & Wiley Interscience, Ch. 13; 
Grant et al., 1987, Expression and Secretion Vectors for 
Yeast, in Methods in Enzymology, Eds. Wu & Grossman, 31987, 

- 23 - 



WO 96/40720 



PO7US96/08311 



Acad. Press, N.Y., Vol. 153, pp. 516-544; Glover, 1986, DNA 
Cloning, Vol. II, IRL Press, Wash., D.C., Ch. 3; Bitter, 
1987, Heterologous Gene Expression in Yeast, Methods in 
Enzymology, Eds. Berger & Kimmel Acad. Press, N.Y., Vol. 152, 
5 pp. 673-694; and The Molecular Biology of the Yeast 

Saccharomyces, 1982, Eds. Strathem et al., Cold Spring Harbor 
Press, Vols. I and II. For complementation assays in yeast, 
cDNAs for PGHS-2 may be cloned into yeast episomal plasmids 
(YEp) which replicate autonomously in yeast due to the 

10 presence of the yeast 2/x circle. The cDNA may be cloned 
behind either a constitutive yeast promoter such as ADH or 
LEU2 or an inducible promoter such as GAL (Cloning in Yeast, 
Chpt. 3, R. Roths tein In: DNA Cloning Vol. 11, A Practical 
Approach, Ed. DM Glover, 1986, IRL Press, Wash., D.C- ). 

15 constructs may contain the 5» and 3 1 non-translated regions 
of the cognate PGHS-2 mRNA or those corresponding to a yeast 
gene. YEp plasmids transform at high efficiency and the 
plasmids are extremely stable. Alternatively, vectors may.be 
used which promote integration of foreign DNA sequences into 

20 the yeast chromosome. 

Alternately, active, post-translationally modified human 
PGHS-1 and PGHS-2 proteins can be obtained using a yeast 
expression system such as the Pichia pastoris expression 
system marketed by Invitrogen (Pichia pastoris is owned and 

25 licensed by Research Corporation Technologies, Tucson, AZ; 
however, all components are available from Invitrogen, San 
Diego, CA) . In this example, cDNAs encoding human PGHS-2 and 
PGHS-1 are independently cloned into the pHIL-D2 Pichia 
expression vector. After linearization with a restriction 

30 endonuclease, these constructs are transfected into 

spheroblasts of the his4 Pichia pastoris strain, GS115, and 
recombinant yeast carrying the cloned PGHS-1 or. PGHS-2 DNA 
sequences are identified by screening for yeast clones that 
grow in the absence of histidine (now supplied by the 

35 recombinant vector) , but do not efficiently utilize methanol 
as the sole carbon source (due to the presence of PGHS-1 or 
PGHS-2 in the place of AOXJ gene sequence coding for methanol 
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utilization) . After expansion of such clones in the presence 
of an alternative carbon source such as glycerol, large 
quantities of cells would be transferred to liquid media 
containing methanol where replication ceases. However, cells 
5 remain viable for many days during which time human PGHS-1 or 
PGHS-2 proteins are specifically expressed at high levels 
under control of the AOXI promoter. The advantages of this 
system include very high protein yields and lower expense in 
the production and maintenance of cultures. 

10 m cases where plant expression vectors are used, the 

expression of the PGHS-2 coding sequence may be driven by any 
of a number of promoters. For example, viral promoters such 
as the 35S RNA and 19S RNA promoters of CaMV (Brisson et al., 
1984, Nature, 310:511-514), or the coat protein promoter of 

15 TMV (Takamatsu et al., 1987, EMBO J., 6:307-311) may be used; 
alternatively, plant promoters such as the small subunit of 
RUBISCO (Coruzzi et al., 1994, EMBO J., 3:1671-1680; Broglie 
et al., 1984, Science, 224:838-843); or heat shock promoters, 
eg., soybean hsp 17.5-E or hsp 17.3-B (Gurley et al., 1986, 

20 Mol. Cell. Biol., 6:559-565) may be used. These constructs 
can be introduced into plant cells using Ti plasmids, Ri 
plasmids, plant virus vectors; direct DNA transformation; 
microinjection, electroporation, etc. For reviews of such 
techniques see, for example, Weissbach & Weissbach, 1988, 

25 Methods for Plant Molecular Biology, Academic Press, NY, 

Section VIII, pp. 421-463; and Grierson & Corey, 1988, Plant 
Molecular Biology, 2d Ed., Blackie, London, Ch. 7-9. 

An alternative expression system which could be used to 
express PGHS-2 is an insect system. In one such system, 

30 Autographa californica nuclear polyhedrosis virus (AcNPV) is 
used as a vector to express foreign genes. The virus grows 
in Spodoptera frugiperda -cells. The PGHS-2 sequence may be 
cloned into non-essential regions (for example the polyhedrin 
gene) of the virus and placed under control of an AcNPV 

35 promoter (for example the polyhedrin promoter) . Successful 
insertion of the coding sequence will result in inactivation 
of the polyhedrin gene and production of non-occluded re- 
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combinant virus (i.e., virus lacking the proteinaceous coat 
coded for by the polyhedrin gene) . These recombinant viruses 
are then used to infect Spodoptera frugiperda cells in which 
the inserted gene is expressed, (e.g., see Smith et al., 
5 1983, J. Virol., 46:584, Smith, U.S. Pat. No. 4,215,051). 

In a specific embodiment of an insect system, the DNA 
encoding human PGHS-2 or PGHS-1 can be independently cloned 
into the pBlueBacIII recombinant transfer vector (Invitrogen, 
San Diego, CA) downstream of the polyhedrin promoter and 

10 transfected into Sf 9 insect cells (derived from Spodoptera 
frugiperda ovarian cells, available from Invitrogen, San 
Diego, CA) to generate recombinant virus containing human 
PGHS-1 or PGHS-2. After plaque purification of the 
recombinant virus high-titer viral stocks are prepared that 

15 in turn would be used to infect Sf 9 or High Five™ 

(BTI-TN-5B1-4 cells derived from Trichoplusia ni egg cell 
homogenates; available from Invitrogen, San Diego, CA) insect 
cells, to produce large quantities of appropriately post- 
translationally modified PGHS-1 or PGHS-2 proteins. Although 

20 it is possible that these cells themselves could be directly 
useful for drug assays, the PGHS-1 or PGHS-2 proteins 
prepared by this method can be used for in vitro assays of 
drug potency and selectivity. 

25 5.2.2. IDENTIFICATION OF TRANSFECTANTS 

OR TRANSFORHANTS EXPRESSING 
THE PGHS-2 GENE PRODUCT 

The host cells which contain the PGHS-2 coding sequence 

and which express the biologically active gene product may be 

identified by at least four general approaches: (a) DNA-DNA 

30 

or DNA-RNA hybridization; (b) the presence or absence of 
"marker" gene functions; (c) assessing the level of 
transcription as measured by the expression of PGHS-2 mRNA 
transcripts in the host cell; and (d) detection of the gene 
product as measured by immunoassay or by its biological 

35 

activity. 
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and which express the biologically active gene product may be 

identified by at least four general approaches: (a) DNA-DNA 

30 

or DNA-RNA hybridization; (b) the presence or absence of 
"marker" gene functions; (c) assessing the level of 
transcription as measured by the expression of PGHS-2 mRNA 
transcripts in the host cell; and (d) detection of the gene 

product as measured by immunoassay or by its biological 

35 _ . . _ 
activity. 
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In the first approach, the presence of the PGHS-2 coding 
sequence inserted in the expression vector can be detected by 
DNA-DNA or DNA-RNA hybridization or PCR using probes 
comprising nucleotide sequences that are homologous to the 
5 mouse PGHS-2 coding sequence [SEQ ID NO:l] or human PGHS-2 
coding sequence [SEQ ID NO: 3] substantially as shown in 
Figures 1 and 6A-6B, or portions or derivatives thereof. 

In the second approach, the recombinant expression 
vector/host system can be identified and selected based upon 

10 the presence or absence of certain "marker" gene functions 
(e.g., resistance to antibiotics, resistance to methotrexate, 
transformation phenotype, occlusion body formation in 
baculovirus, etc.). For example, if the PGHS-2 coding 
sequence is within a marker gene sequence of the vector, 

15 recombinants containing the PGHS-2 coding sequence can be 
identified by the absence of the marker gene function. 
Alternatively, a marker gene can be placed in tandem with the 
PGHS-2 sequence under the control of the same or different., 
promoter used to control the expression of the PGHS-2 coding 

20 sequence. Expression of the marker in response to induction 
or selection indicates expression of the PGHS-2 coding 
sequence. In addition, the marker gene may be identified by 
DNA-DNA or DNA-RNA hybridization or PCR. 

In the third approach, transcriptional activity for the 

25 PGHS-2 coding region can be assessed by hybridization or PCR 
assays. For example, RNA can be isolated and analyzed by 
Northern blot using a probe homologous to the PGHS-2 coding 
sequence or particular portions thereof substantially as 
shown in Figure 1 (murine, [SEQ ID N0:1]) or Figures 6A-6B 

30 (human, SEQ ID NO: 3]). Alternatively, total nucleic acids of 
the host cell may be extracted and assayed for hybridization 
to such probes. 

In the fourth approach, the expression of the PGHS-2 
protein product can be assessed immunologically, for example 

35 by Western blots, immunoassays such as radioimmuno- 

precipitation, enzyme-linked immunoassays and the like. The 
ultimate test of the success of the expression system, 
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however, involves the detection of the biologically active 
PGHS-2 gene product. Where the host cell secretes the gene 
product, the cell free media obtained from the cultured 
transfectant host cell may be assayed for PGHS-2 activity. 
5 Where the gene product is not secreted, cell ly sates may be 
assayed for such activity. In either case, a number of 
assays can be used to detect PGHS-2 activity including but 
not limited to the following: cyclooxygenase activity may be 
determined in the culture medium by the addition of exogenous 
10 arachidonic acid substrate (30 jiM for 15 min. at 37 °C) 

followed by conversion of the prostaglandin product to a 
methyl oximate form. This derivative may then be quantitated 
by radioimmunoassay (kit from Amersham Corp.) 

15 CELL LINES EXPRESSING PGHS-1 OR PGHS-2 

The present invention also relates to cell lines 
containing recombinant DNA sequence, preferably a 
chromosomally integrated recombinant DNA sequence, which 
comprises a gene encoding the regulated inflammatory 

20 cyclooxygenase griPGHS or "PGHS-2 11 which cell lines further 
do not express autologous PGHS-1 or PGHS-2, apart from that 
encoded by the recombinant DNA sequence. The recombinant DNA 
also does not encode constitutive PGHS-1 (EC 1.14.99.1). 
A specific embodiment of the present invention is an 

25 engineered mammalian cell line which contains a chromosomally 
integrated, genetically-engineered ("recombinant") DNA 
sequence, which DNA sequence expresses mammalian, preferably 
human, PGHS-2, but does not express constitutive mammalian 
PGHS-1, and wherein said cell line also does not express 

30 autologous PGHS-1 or PGHS-2. The cell line is preferably of 
human or primate origin, such as the exemplified monkey 
kidney COS cell line, but cell lines derived from other 
species may be employed, including chicken, hamster, murine, 
ovine and the like; the CHO (Chinese hamster ovary) cell line 

35 for example, may be particularly preferred for large scale 
production. 
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encoded by the recombinant DNA sequence. The recombinant DNA 
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25 engineered mammalian cell line which contains a chromosomally 
integrated, genetically-engineered ("recombinant" ) DNA 
sequence, which DNA sequence expresses mammalian, preferably 
human, PGHS-2, but does not express constitutive mammalian 
PGHS-1, and wherein said cell line also does not express 

30 autologous PGHS-1 or PGHS-2. The cell line is preferably of 
human or primate origin, such as the exemplified monkey 
kidney COS cell line, but cell lines derived from other 
species may be employed, including chicken, hamster, murine, 
ovine and the like; the CHO (Chinese hamster ovary) cell line 

35 for example, may be particularly preferred for large scale 
production. 



28 - 



WO 96/40720 



PCT/US96/08311 



Any cell or cell line, the genotype of which has been 
altered by the presence of a recombinant DNA sequence is 
encompassed by the invention. The recombinant DNA sequence 
may also be referred to herein as "heterologous DNA," 
5 "exogenous DNA," "genetically engineered" or "foreign DNA," 
indicating that the DNA was introduced into the genotype or 
genome of the cell or cell line by a process of genetic 
engineering. 

The invention includes, but is not limited to, a cell or 
10 cell line wherein the native PGHS-2 DNA sequence has been 
removed or replaced as a result of interaction with a 
recombinant DNA sequence. Such cells are called PGHS-2 
knockouts, herein, if the resulting cell is left without a 
native DNA that encodes a functional PGHS-2 gene product. 
15 As used herein, the term "recombinant DNA sequence" 

refers to a DNA sequence that has been derived or isolated 
from any source, that may be subsequently chemically altered, 
and later introduced into mammalian cells. An example of a 
recombinant DNA sequence "derived" from a source, would be a 
20 DNA sequence that is identified as a useful fragment within a 
given organism, and which is then chemically synthesized in 
essentially pure form. An example of such DNA sequence 
"isolated" from a source would be a DNA sequence that is 
excised or removed from said source by chemical means, e.g., 
25 by the use of restriction endonucleases, so that it can be 
further manipulated, e.g., amplified, for use in the 
invention, by the methodology of genetic engineering. 

Therefore, "recombinant DNA sequence" includes 
completely synthetic DNA, semi-synthetic DNA, DNA isolated 
30 from biological sources, and DNA derived from introduced RNA. 
Generally, the recombinant DNA sequence is not originally 
resident in the genotype which is the recipient of the DNA 
sequence, or it is resident in the genotype but is not 
expressed. 

35 The isolated recombinant DNA sequence used for 

transformation herein may be circular or linear, double- 
stranded or single-stranded. Generally, the DNA sequence is 
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chimeric linear DNA, or is a plasmid or viral expression 
vector, that can also contain coding regions flanked by 
regulatory sequences which promote the expression of the 
recombinant DNA present in the resultant cell line* For 
5 example, the recombinant DNA sequence may itself comprise or 
consist of a promoter that is active in mammalian cells, or 
may utilize a promoter already present in the genotype that 
is the transformation target. Such promoters include the CMV 
promoter depicted in Figure 4, as well as the SV 40 late 
10 promoter and retroviral LTRs (long terminal repeat elements). 
The general methods for constructing recombinant DNA 
which can transform target cells are well known to those 
skilled in the art, and the same compositions and methods of 
construction may be utilized to produce the DNA useful 
15 herein. For example, J. Sambrook et al., Molecular Cloning; 
A Laboratory Manual, Cold Spring Harbor Laboratory Press (2d 
ed., 1989), provides suitable methods of construction. 

Aside from recombinant DNA sequence that serve as 
transcription units for PGHS-1, PGHS-2 or other portions 
20 thereof, a portion of the recombinant DNA may be 

untranscribed, serving a regulatory or a structural function. 

The recombinant DNA sequence to be introduced into the 
cells further will generally contain either a selectable 
marker gene or a reporter gene or both to facilitate 
25 identification and selection of transformed cells. 

Alternatively, the selectable marker may be carried on a 
separate piece of DNA and used in a co-transformation 
procedure. Both selectable markers and reporter genes may be 
flanked with appropriate regulatory sequences to enable 
30 expression in mammalian cells. Useful selectable markers are 
well known in the art and include, for example, anti-biotic 
and herbicide resistance genes. 

Sources of DNA sequences useful in the present invention 
include Poly-A RNA from mammalian cells, from which the about 
35 4 kb mRNA encoding PGHS-2 can be derived and used for the 
synthesis of the corresponding cDNA by methods known to the 
art* Such sources include the lambda ZAP II (Stratagene) 
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library of size fractionated poly-A RNA isolated from C127 
murine fibroblasts treated with serum and cycloheximide as 
described by O'Banion et al., 1991, J. Biol. Chem. , 
266:23261-23267. Xie et al. obtained mRNA encoding chicken 
5 PGHS-2 as described in 1991, Proc. Nat f l. Acad. Sci. USA, 
88:2692-2696. Sources of human mRNA encoding PGHS-2 include 
RNA from human monocytes treated with interleukin-1 and 
cycloheximide, in accord with O'Banion et al., 1992, Proc. 
Nat 1 !. Acad. Sci. USA, 89:4888-4892. Sources of human mRNA 
10 encoding PGHS-1 are also well known to the art. 

Selectable marker genes encoding enzymes which impart 
resistance to biocidal compounds are listed in Table 1, 
below. 

Table 1 

15 Selectable Marker Genes 



Resistance 
flene or Enzyme 

Neomycin phospho- 
transferase (neo) 

20 

Hygromycin 
phosphotrans- 
ferase (hpt or 
hyg) 

25 Dihydrofolate 
reductase (dhfr) 



Phosphinothricin 
acetyltransf erase 
(bar) 

30 2,2-Dichloropro- 
pionic acid 
dehalogenase 



Acetohydroxyacid 
synthase 

35 



Confers 
Resistance to: 

G-418, neomycin, 
kanamycin 



Hygromycin B 



Methotrexate 



Phosphinothricin 



2-2 , Dichloropro- 
pionic acid 
(Dalapon) 



Sulfonylurea, 
imidazolinone and 
triazolopyrimidine 
herbicides 



Reference 

Southern et 

al., 1982, J. Mol. 

Appl. Gen., 1:327-341 



Shimizu et al. , 
Mol. Cell Biol. , 
6:1074-1087 



1986, 



Kwok et al. , 1986, 
Proc. Nat'l. Acad. 
Sci. USA, 4552-4555 

DeBlock et al., 1987, 
EMBO J., 6:2513-2518 



Buchanan-Wollaston et 
al., 1989, J. Cell. 
Biochem. , Supp. 13D, 
330 

Anderson et al. (U.S. 
Patent No. 
4,761,373) ; G.W. 
Haughn et al., 1988 
Mol. Gen. Genet., 
211:266-271 
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5 - Enolpyruvy 1 - 
shikimatephos- 
phate synthase 
(aroA) 



Glyphosate 



Haloarylnitrilase Bromoxynil 



Acetyl-coenzyme A 
carboxylase 



10 Dihydropteroate 
synthase (sul I) 



32 kD photosystem 
II polypeptide 
(psbA) 

15 Anthranilate 
synthase 

Dihydrodipicolin- 
ic acid synthase 
(dap A) 

20 



Sethoxydim, 
haloxyfop 



Sulfonamide 
herbicides 



Triazine herbicides 



5 -Me thy 1 tryptophan 



Aminoethyl cysteine 



Comai et al., 1985 
Nature, 317:741-744 



Stalker et al. , 
published PCT appln. 
W087/04181 

Parker et al., 1990 
Plant Physiol. , 
92:1220 

Guerineau et al., 
1990, Plant Molec. 
Biol., 15:127-136 

Hirschberg et al., 
1983, Science, 
222:1346-1349 

Hibberd et al. (U.S. 
Patent No. 4,581,847) 

Glassman et al., 
published PCT 
application No. 
W089/11789 



Reporter genes are used for identifying potentially 
transformed cells and for evaluating the functionality of 
regulatory, sequences. Reporter genes which encode for easily 
assayable marker proteins are well known in the art. In 
general, a reporter gene is a gene which is not present in or 
expressed by the recipient organism or tissue and which 
encodes a protein whose expression is manifested by some 
easily detectable property, e.g., enzymatic activity. 
Preferred genes includes the chloramphenicol acetyl 
transferase gene (cat) from Tn9 of E. coli, the beta- 
galactosidase gene of E. coli, the beta-glucuronidase gene 
(gus) of the uidA locus of E. coli, and the lucif erase gene 
from firefly Photinus pyralis. Expression of the reporter 
gene is assayed at a suitable time after the DNA has been 
introduced into the recipient cells. 
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galactosidase gene of E. coli, the beta-glucuronidase gene 
(gus) of the uidA locus of E. coli, and the lucif erase gene 
from firefly Photinus pyralis. Expression of the reporter 

35 gene is assayed at a suitable time after the DNA has been 
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Other elements such as introns, enhancers, 
polyadenylation sequences and the like, may also be a part of 
the recombinant DNA sequence. Such elements may or may not 
be necessary for the function of the DNA, but may provide 
5 improved expression of the DNA by affecting transcription, 
stability of the mRNA, or the like. Such elements may be 
included in the DNA as desired to obtain the optimal 
performance of the transforming DNA in the cell. 

The recombinant DNA sequence can be readily introduced 

10 into the target cells by transfection with an expression 
vector, such as a viral expression vector, comprising cDNA 
encoding PGHS-2 or PGHS-1 by the modified calcium phosphate 
precipitation procedure of Chen et al., 1987, Mol. Cell. 
Biol., 7:2745-2752. Transfection can also be accomplished by 

15 other methods, including lipofection, using commercially 
available kits, e.g., provided by Life Technologies. 

In a preferred embodiment of the invention, the cell 
lines of the invention are able to express a stable PGHS-2 
gene product or analog, homologue, or deletion thereof after 

20 several passages through cell culture. While the in- 
stability of the PGHS-2 gene product has been hypothesized to 
be attributable to the 3 f non-coding region of the PGHS-2 
mRNA, it has been found that even cell lines which do not 
include this 3* region are often unable to express a stable 

25 PGHS-2 gene product for more than approximately five (5) 
passages in cell culture. The cell lines of the invention, 
however, are able to continue to produce a stable PGHS-2 gene 
product even after at least 5, 10, 15, or 20 passages through 
cell culture. The cell lines of the invention were selected 

30 by the single cell cloning of those cells which were able to 
continue to stably produce PGHS-2 even after the mere five 
passages through cell culture which defined the expressing 
limit of the cells of the prior art. 

35 5.2.4. PURIFICATION OP THE PGHS-2 GENE 

PRODUCT 
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Once a cell that produces high levels of biologically 
active PGHS-2 is identified, the cell may be clonally 
expanded and used to produce large quantities of the enzyme, 
which may be purified using techniques well-known in the art 
5 including, but not limited to, immunoaf f inity purification, 
chromatographic methods including high performance liquid 
chromatography and the like. Where the enzyme is secreted by 
the cultured cells, PGHS-2 may be readily recovered from the 
culture medium. 

10 Where the PGHS-2 coding sequence, or fragment thereof, 

has been engineered to encode a cleavable fusion protein, the 
purification of the PGHS-2 gene product, or fragment thereof, 
may be readily accomplished using affinity purification 
techniques. For example, an antibody specific for the 

15 heterologous peptide or protein can be used to capture the 
durable fusion protein; for example, on a solid surface, a 
column etc. The PGHS-2 moiety can be released by treatment 
with the appropriate enzyme that cleaves the linkage site. - 
cDNA construction using the polymerase chain reaction 

20 accompanied by transfection and purification of the expressed 
protein permits the isolation of sufficient quantities of 
PGHS-2 for characterization of the enzyme's physical and 
kinetic properties. Using site-directed mutagenesis or 
naturally occurring mutant sequences, this system provides a 

25 reasonable approach to determine the effects of the altered 
primary structure on the function of the protein. Fusion 
constructs of the PGHS-2 protein domain with the marker 
peptide preceding the amino terminus of PGHS-2 or following 
the carboxy terminus of PGHS-2 may also be engineered to 

30 evaluate which fusion construct will interfere the least, if 
at all, with the protein's biologic function and the ability 
to be purified. 

Using this aspect of the invention, any cleavage site or 
enzyme cleavage substrate may be engineered between the PGHS- 

35 2 sequence and a second peptide or protein that has a binding 
partner which could be used for purification, e.g, any 
antigen for which an immunoaf f inity column can be prepared. 
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5.3. ANTIBODIES TO THE PGHS-2 GENE PRODUCT 
For the production of antibodies, various host animals 
may be immunized by injection with the PGHS-2 gene product, 
or a portion thereof including, but not limited to, portions 
5 of the PGHS-2 gene product in a recombinant protein. Such 
host animals may include but are not limited to rabbits, 
mice, and rats, to name but a few. Various adjuvants may be 
used to increase the immunological response, depending on the 
host species, including but not limited to Freund's (complete 

10 and incomplete) , mineral gels such as aluminum hydroxide, 
surface active substances such as lysolecithin, pluronic 
polyols, polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanin, dinitrophenol, and potentially useful human 
adjuvants such as BCG (bacille Calmette-Guerin) and 

15 Corynebacterium pairvum. 

Monoclonal antibodies may be prepared by using any 
technique which provides for the production of antibody 
molecules by continuous cell lines in culture. These incltide 
but are not limited to the hybridoma technique originally 

20 described by Kohler and Milstein, 1975, Nature, 256:495-497, 
the human B-cell hybridoma technique (Kosbor et al., 1983, 
Immunology Today, 4:72, Cote et al., 1983, Proc. Natl. Acad. 
Sci., 80:2026-2030) and the EBV-hybridoma technique (Cole et 
al., 1985, Monoclonal Antibodies and Cancer Therapy, Alan R. 

25 Liss, Inc., pp. 77-96). In addition, techniques developed 
for the production of "chimeric antibodies 11 (Morrison et al., 
1984, Proc. Natl. Acad. Sci., 81:6851-6855; Neuberger et al. , 
1984, Nature, 312:604-608; Takeda et al., 1985, Nature, 
314:452-454) by splicing the genes from a mouse antibody 

30 molecule of appropriate antigen specificity together with 
genes from a human antibody molecule of appropriate 
biological activity can" be used. Alternatively/ techniques 
described for the production of single chain antibodies (U.S. 
Patent No. 4,946,778) can be adapted to produce single chain 

35 antibodies specific to one of the binding partners. 

Antibody fragments which recognize specific epitopes may 
be generated by known techniques. For example, such 
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fragments include but are not limited to: the F(ab f ) 2 
fragments which can be produced by pepsin digestion of the 
antibody molecule and the Fab fragments which can be 
generated by reducing the disulfide bridges of the F(ab') 2 
5 fragments. Alternatively, Fab expression libraries may be 
constructed (Huse et al., 1989, Science, 246:1275-1281) to 
allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity. 

10 5.4. DIAGNOSTICS 

The DNA of the invention encoding the PGHS-2 gene or 
homologues, analogues, or fragments' thereof may be used in 
accordance with the invention to diagnose disease states 
which are phenotypic of an aberrant PGHS-2 genotype or of 

15 aberrant PGHS-2 expression. 

For example, but not by way of limitation, in pulmonary 
fibrosis from radiation or chronic pulmonary disease, and in 
the skin disorder scleroderma, only a small percentage of " 
those afflicted respond to glucocorticoids, McCune et al., 

20 1994, Curr. Opin. Rheum., 6 (3) : 262-272 ; Muir and Benhamou, 
1994, [French] Annales de Med. Intern., 145 (Suppl) :34-36; 
Labrune and Huchon, 1991, [French] Revue du Praticien, 
41(14) :1275-1277. These two disorders have been associated, 
Steen et al., 1994, Arthritis & Rheum., 37 (9) : 1290-1296; 

25 wells et al., 1994, Am. J. Resp. & Crit. Care Med., 149(6) 
1583-1590. Therefore, both these disorders may be 
characterized by a constitute over expression of PGHS-2 or by 
excessive longevity of the PGHS-2 message which, in either 
case, is not diminished by glucocorticoid. 

30 By way of another example, but not by way of limitation, 

many tumors may be characterized by a lack of, or excess of, 
PGHS-2 activity which may stem from mutations in the PGHS-2 
coding or regulatory sequence. 

In both of the examples above, afflicted cells, tissue 

35 sections, or biopsy specimens may be screened with the PGHS-2 
DNA sequences of the invention and isolated PGHS-2 sequenced 
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to determine which mutations in PGHS-2 are associated with 
the diseases. The DNAs of the invention may also be used to 
determine whether an individual carries an aberrant PGHS-2 
gene. 

5 In a specific embodiment of the invention , the detection 

of the aberrant PGHS-2 DNA is conducted by PCR amplification 
from a small tissue sample. Detection may also be via in 
situ hybridization or immunocytochemistry of pathology or 
biopsy specimens. 

10 

5.5. GENE THERAPIES BASED ON THE PGHS-2 GENE 
A variety of gene therapy approaches may be used in 
accordance with the invention to modulate expression of the 
PGHS-2 gene in vivo. For example, antisense DNA molecules 

15 may be engineered and used to block translation of PGHS-2 
mRNA in vivo. Alternatively, ribozyme molecules may be 
designed to cleave and destroy the PGHS-2 mRNAs in vivo. In 
another alternative, oligonucleotides designed to hybridize 
to the 5 9 region of the PGHS-2 gene (including the region 

20 upstream of the coding sequence) and form triple helix 

structures may be used to block or reduce transcription of 
the PGHS-2 gene. In yet another alternative, nucleic acid 
encoding the full length wild-type PGHS-2 message may be 
introduced in vivo into cells which otherwise would be unable 

25 to produce the wild-type PGHS-2 gene product in sufficient 
quantities or at all. 

In a preferred embodiment, the antisense, ribozyme and 
triple helix nucleotides are designed to inhibit the 
translation or transcription of PGHS-2 with minimal effects 

30 on the expression of PGHS-1. To accomplish this, the 
oligonucleotides used should be designed on the basis of 
relevant sequences unique to PGHS-2; i.e., those sequences 
found in PGHS-2 and not in PGHS-1. 

For example, and not by way of limitation, the 
35 oligonucleotides should not fall within those region where 
the nucleotide sequence of PGHS-2 is most homologous to that 
of PGHS-1 (see Figs. 10A-10D) , or the PGHS-2 sequence which 
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is shown in Figure 10 to be identically conserved between 
PGHS-1 and PGHS-2. These sequences include the following 
regions in the nucleotide sequence of PGHS-2: 

427-457 

5 555-601 

624-646 
822-901 
975-997 
1116-1154 

10 1251-1282 

1596-1634 

Instead, it is preferred that the oligonucleotides fall 
within the following regions of PGHS-2, which are shown in 
Figs. 10A-10D to diverge from the sequence of PGHS-1. These 
15 sequences include the following regions in the nucleotide 
sequence of PGHS-2: 

171-254 
299-340 
486-512 

20 602-623 

1214-1250 
1283-1346 
1521-1580 
1718-1834 

25 in the case of antisense molecules, it is preferred that 

the sequence be chosen from the list above. It is also 
preferred that the sequence be at least 18 nucleotides in 
length in order to achieve sufficiently strong annealing to 
the target xnRNA sequence to prevent translation of the 

30 sequence. Izant and Weintraub, 1984, Cell, 36:1007-1015; 
Rosenberg et al., 1985, Nature, 313:703-706. 

In the case of the "hammerhead" type of ribozymes, it is 
also preferred that the target sequences of the ribozymes be 
chosen from the list above. Ribozymes are RNA molecules 

35 which possess highly specific endoribonuclease activity. 
Hammerhead ribozymes comprise a hybridizing region which is 
complementary in nucleotide sequence to at least part of the 
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target RNA, and a catalytic region which is adapted to cleave 
the target RNA. The hybridizing region contains nine (9) or 
more nucleotides. Therefore, the hammerhead ribozymes of the 
present invention have a hybridizing region which is 
5 complementary to the sequences listed above and is at least 
nine nucleotides in length • The construction and production 
of such ribozymes is well known in the art and is described 
more fully in Haseloff and Gerlach, 1988, Nature, 334:585- 
591. 

10 The ribozymes of the present invention also include RNA 

endoribonucleases (hereinafter "Cech-type ribozymes 11 ) such as 
the one which occurs naturally in Tetrahymena Thermophila 
(known as the IVS, or L-19 IVS RNA) and which has been 
extensively described by Thomas Cech and collaborators (Zaug, 

15 et al., 1984, Science, 224:574-578; Zaug and Cech, 1986, 
Science, 231:470-475; Zaug, etal., 1986, Nature, 324:429- 
433; published International patent application No. WO 
88/04300 by University Patents Inc.; Been and Cech, 1986, 
Cell, 47:207-216). The Cech endoribonucleases have an eight 

20 base pair active site which hybridizes to a target RNA 

sequence whereafter cleavage of the target RNA takes place. 
The invention encompasses those Cech-type ribozymes which 
target eight base-pair active site sequences that are present 
in PGHS-2 but not PGHS-1. 

25 in the case of oligonucleotides that hybridize to and 

form triple helix structures at the 5' terminus of the PGHS-2 
gene and can be used to block transcription, it is preferred 
that they be complementary to those sequences in the 5' 
terminus of PGHS-2 which are not present in PGHS-1 (see Figs. 

30 HA-11C) . Because of the lack of homology between these 

regions of PGHS-2 and PGHS-1, any sequence sufficiently long 
to hybridize to the PGHS-2 promoter will not hybridize to the 
promoter of PGHS-1. However, it is preferred that the 
sequences not include those regions of the PGHS-2 promoter 

35 which are even slighly homologous to that of PGHS-1. These 
slightly homologous sequences include the following regions 
in the nucleotide sequence of the PGHS-2 promoter: 
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382-438 
669-696 
797-826 
856-885 

5 980-1008 

1142-1170 
1204-1252 
1863-1898 
2013-2101 

10 2126-2175 

2356-2396 

The foregoing compounds can be administered by a variety 
of methods which are known in the art including, but not 
limited to the use of liposomes as a delivery vehicle. Naked 

15 DNA or RNA molecules may also be used where they are in a 
form which is resistant to degradation such as by 
modification of the ends, by the formation of circular 
molecules, or by the use of alternate bonds including 
phosphothionate and thiophosphoryl modified bonds. In 

20 addition, the delivery of nucleic acid may be by facilitated 
transport where the nucleic acid molecules are conjugated to 
poly- lysine or transferrin. Nucleic acid may also be 
transported into cells by any of the various viral carriers, 
including but not limited to, retrovirus, vaccinia, AAV, and 

25 adenovirus. 

Alternatively, a recombinant nucleic acid molecule which 
encodes, or is, such antisense, ribozyme, triple helix, or 
PGHS-2 molecule can be constructed. This nucleic acid 
molecule may be either RNA or DNA. If the nucleic acid 

30 encodes an RNA, it is preferred that the sequence be 
operatively attached to a regulatory element so that 
sufficient copies of the desired RNA product are produced. 
The regulatory element may permit either constitutive or 
regulated transcription of the sequence. In vivo, that is, 

35 within the cells or cells of an organism, a transfer vector 
such as a bacterial plasmid or viral RNA or DNA, encoding one 
or more of the RNAs, may be transfected into cells e.g. 
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including but not limited to, retrovirus, vaccinia, AAV, and 

25 adenovirus. 

Alternatively, a recombinant nucleic acid molecule which 
encodes, or is, such antisense, ribozyme, triple helix, or 
PGHS-2 molecule can be constructed. This nucleic acid 
molecule may be either RNA or DNA. If the nucleic acid 

30 encodes an RNA, it is preferred that the sequence be 
operatively attached to a regulatory element so that 
sufficient copies of the desired RNA product are produced. 
The regulatory element may permit either constitutive or 
regulated transcription of the sequence. In vivo, that is, 

35 within the cells or cells of an organism, a transfer vector 
such as a bacterial plasmid or viral RNA or DNA, encoding one 
or more of the RNAs, may be transfected into cells e.g. 

- 40 - 



WO 96/40720 



PCT/US96/08311 



(Llewellyn et al., 1987, J, Mol. Biol. f 195:115-123; Hanahan 
et al. 1983, J. Mol. Biol., 166:557-580). Once inside the 
cell, the transfer vector may replicate, and be transcribed 
by cellular polymerases to produce the RNA or it may be 
5 integrated into the genome of the host cell. Alternatively, 
a transfer vector containing sequences encoding one or more 
of the RNAs may be transfected into cells or introduced into 
cells by way of micromanipulation techniques such as 
microinjection, such that the transfer vector or a part 
10 thereof becomes integrated into the genome of the host cell. 

5.6. DRUG SCREENING ASSAYS 

The present invention provides a simple in vitro system 
for the screening of drug actions on both the constitutive 

15 and the inflammatory cyclooxygenase, which will be useful for 
the development of drugs that selectively inhibit 
inflammation without producing the side effects due to 
inhibition of constitutive prostaglandin production. Assays 
can be performed on living mammalian cells, which more 

20 closely approximate the effects of a particular serum level 
of drug in the body, or on microsomal extracts prepared from 
the cultured cell lines. Studies using microsomal extracts 
offer the possibility of a more rigorous determination of 
direct drug/enzyme interactions. 

25 The PGHS-2-synthesizing cell lines are useful for 

evaluating the activity of potential bioactive agents on the 
inflammatory cyclooxygenase, since the elevated levels of 
prostaglandins that are a primary hallmark of inflammation 
and account for much of the adverse effects of inflammation, 

30 result from increases in the level of PGHS-2, rather than in 
changes in constitutively expressed cyclooxygenase, PGHS-1. 

The present invention also provides a second mammalian 
cell line which contains a chromosomally integrated, 
recombinant DNA sequence, wherein said DNA sequence expresses 

35 mammalian, preferably human, PGHS-1, and wherein said DNA 
sequence does not express PGHS-2, and wherein said cell line 
also preferably does not express autologous PGHS-1 or PGHS-2 
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(Llewellyn et al., 1987 , J. Mol. Biol., 195:115-123; Hanahan 
et al. 1983, J. Mol. Biol., 166:557-580). Once inside the 
cell, the transfer vector may replicate, and be transcribed 
by cellular polymerases to produce the RNA or it may be 
5 integrated into the genome of the host cell. Alternatively, 
a transfer vector containing sequences encoding one or more 
of the RNAs may be transfected into cells or introduced into 
cells by way of micromanipulation techniques such as 
microinjection, such that the transfer vector or a part 
10 thereof becomes integrated into the genome of the host cell. 

5.6. DRUG SCREENING ASSAYS 

The present invention provides a simple in vitro system 
for the screening of drug actions on both the constitutive 

15 and the inflammatory cyclooxygenase, which will be useful for 
the development of drugs that selectively inhibit 
inflammation without producing the side effects due to 
inhibition of constitutive prostaglandin production. Assays 
can be performed on living mammalian cells, which more 

20 closely approximate the effects of a particular serum level 
of drug in the body, or on microsomal extracts prepared from 
the cultured cell lines. Studies using microsomal extracts 
offer the possibility of a more rigorous determination of 
direct drug/ enzyme interactions. 

25 The PGHS-2 -synthesizing cell lines are useful for 

evaluating the activity of potential bioactive agents on the 
inf lammatory cyclooxygenase, since the elevated levels of 
prostaglandins that are a primary hallmark of inflammation 
and account for much of the adverse effects of inflammation, 

30 result from increases in the level of PGHS-2, rather than in 
changes in constitutively expressed cyclooxygenase, PGHS-1. 

The present invention also provides a second mammalian 
cell line which contains a chromosomal ly integrated, 
recombinant DNA sequence, wherein said DNA sequence expresses 

35 mammalian, preferably human, PGHS-1, and wherein said DNA 
sequence does not express PGHS-2, and wherein said cell line 
also preferably does not express autologous PGHS-1 or PGHS-2 
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activity. This second cell line is also preferably a 
primate, murine or human cell line. 

Thus, the present invention also provides a method to 
evaluate the relative inhibitory activity of a compound to 
5 selectively inhibit PGHS-2 versus PGHS-1, and thus to 
specifically inhibit the elevated prostaglandin synthesis 
that occurs in inflamed mammalian tissues, preferably human 
tissues, or in other physiological or pathological conditions 
in a mammalian host, preferably a human host, in which the 
10 PGHS-2 is elevated and the constitutive PGHS-1 is not. This 
assay comprises contacting the present PGHS-2 -expressing 
transgenic ceil line or a microsomal extract thereof with a 
preselected amount of the compound in a suitable culture 
medium or buffer, adding arachidonic acid to the mixture, and 
15 measuring- the level of synthesis of a PGHS-mediated 

arachidonic acid metabolite, i.e. , thromboxane synthesis, 
prostaglandin synthesis, e.g., the synthesis of PGEj, or the 
synthesis of any other metabolite unique to the 
cyclooxygenase pathway, by said cell line, or said microsomal 
20 extract, as compared to a control cell line or portion of 
microsomal extract in the absence of said compound. The 
compound can be evaluated for its ability to selectively 
inhibit PGHS-1 or PGHS-2 by performing a second assay 
employing the above-described steps, but substituting the 
25 PGHS-l-expressing transgenic cell line for the PGHS-2- 
expressing cell line of the invention. 

More specifically, the present- invention provides a 
method of determining the ability of a compound to inhibit 
prostaglandin, synthesis catalyzed by PGHS-2 or PGHS-1 in 
30 mammalian cells comprising: 

(a) adding a first preselected amount of said compound 
to a first transgenic mammalian cell line in 
culture medium, which cell line contains a chromo- 
somally integrated, recombinant DNA sequence, 
35 wherein said DNA sequence expresses mammalian PGHS- 

2, and wherein said DNA sequence does not express 
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activity. This second cell line is also preferably a 
primate, murine or human cell line. 

Thus, the present invention also provides a method to 
evaluate the relative inhibitory activity of a compound to 
5 selectively inhibit PGHS-2 versus PGHS-1, and thus to 
specifically inhibit the elevated prostaglandin synthesis 
that occurs in inflamed mammalian tissues, preferably human 
tissues, or in other physiological or pathological conditions 
in a mammalian host, preferably a human host, in which the 
10 PGHS-2 is elevated and the constitutive PGHS-1 is not. This 
assay comprises contacting the present PGHS-2 -expressing 
transgenic cell line or a microsomal extract thereof with a 
preselected amount of the compound in a suitable culture 
medium or buffer, adding arachidonic acid to the mixture, and 
15 measuring- the level of synthesis of a PGHS-mediated 

arachidonic acid metabolite, i.e., thromboxane synthesis, 
prostaglandin synthesis, e.g., the synthesis of PGEj, or the 
synthesis of any other metabolite unique to the 
cyclooxygenase pathway, by said cell line, or said microsomal 
20 extract, as compared to a control cell line or portion of 
microsomal extract in the absence of said compound. The 
compound can be evaluated for its ability to selectively 
inhibit PGHS-1 or PGHS-2 by performing a second assay 
employing the above-described steps, but substituting the 
25 PGHS-l-expressing transgenic cell line for the PGHS-2- 
expressing cell line of the invention. 

More specifically, the present- invention provides a 
method of determining the ability of a compound to inhibit 
prostaglandin, synthesis catalyzed by PGHS-2 or PGHS-1 in 
30 mammalian cells comprising: 

(a) adding a first preselected amount of said compound 
to a first transgenic mammalian cell line in 
culture medium, which cell line contains a chromo- 
somally integrated, recombinant DNA sequence, 
35 wherein said DNA sequence expresses mammalian PGHS- 

2, and wherein said DNA sequence does not express 
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PGHS-1, and wherein said cell line does not express 
autologous PGHS-1 or PGHS-2 activity; 

(b) adding arachidonic acid to said culture medium; 

(c) measuring the level of a PGHS-mediated arachidonic 
5 acid metabolite synthesized by said first cell 

line; 

(d) comparing said level with the level of said meta- 
bolite synthesized by said first cell line in the 
absence of said compound; 

10 (e) adding a second preselected amount of said compound 

to a second transgenic mammalian cell line in 
culture medium, which cell line contains chrom- 
osomal ly integrated, recombinant DNA sequence, 
wherein said DNA sequence expresses mammalian PGHS- 

15 1, and wherein said DNA sequence does not express 

PGHS-2, and wherein said cell line does not express 
autologous PGHS-1 or PGHS-2 activity; 
(f ) adding arachidonic acid to said culture medium of 
step (e); 

20 (g) measuring the level of a PGHS-mediated arachidonic 

acid metabolite synthesized by said second cell 
line; and 

(h) comparing said level with the level of said meta- 
bolite synthesized by said second cell line in the 
25 absence of said compound. 

The invention also relates to methods for the 
identification of genes, termed "pathway genes", which are 
associated with the PGHS-2 gene product or with the 
biochemical pathways which extend therefrom. "Pathway gene", 
30 as used herein, refers to a gene whose gene product exhibits 
the ability to interact with the PGHS-2 gene product. 

Any method suitable for detecting protein-protein 
interactions may be employed for identifying pathway gene 
products by identifying interactions between gene products 
35 and the PGHS-2 gene product. Such known gene products may be 
cellular or extracellular proteins. Those gene products 
which interact with such known gene products represent 

- 43 - 



WO 96/40720 



PCT/US96/08311 



PGHS-1, and wherein said cell line does not express 
autologous PGHS-1 or PGHS-2 activity; 

(b) adding arachidonic acid to said culture medium; 

(c) measuring the level of a PGHS-mediated arachidonic 
5 acid metabolite synthesized by said first cell 

line; 

(d) comparing said level with the level of said meta- 
bolite synthesized by said first cell line in the 
absence of said compound; 

10 (e) adding a second preselected amount of said compound 

to a second transgenic mammalian cell line in 
culture medium, which cell line contains chrom- 
osomal ly integrated, recombinant DNA sequence, 
wherein said DNA sequence expresses mammalian PGHS- 

15 1, and wherein said DNA sequence does not express 

PGHS-2, and wherein said cell line does not express 
autologous PGHS-1 or PGHS-2 activity; 
(f ) adding arachidonic acid to said culture medium o£ 
step (e); 

20 (g) measuring the level of a PGHS-mediated arachidonic 

acid metabolite synthesized by said second cell 
line; and 

(h) comparing said level with the level of said meta- 
bolite synthesized by said second cell line in the 
25 absence of said compound. 

The invention also relates to methods for the 
identification of genes, termed "pathway genes", which are 
associated with the PGHS-2 gene product or with the 
biochemical pathways which extend therefrom. "Pathway gene", 
30 as used herein, refers to a gene whose gene product exhibits 
the ability to interact with the PGHS-2 gene product. 

Any method suitable for detecting protein-protein 
interactions may be employed for identifying pathway gene 
products by identifying interactions between gene products 
35 and the PGHS-2 gene product. Such known gene products may be 
cellular or extracellular proteins. Those gene products 
which interact with such known gene products represent 
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pathway gene products and the genes which encode them 
represent pathway genes. 

Among the traditional methods which may be employed are 
co-immunoprecipitation, crosslinking and co-purification 
5 through gradients or chromatographic columns. Utilizing 
procedures such as these allows for the identification of 
pathway gene products. Once identified, a pathway gene 
product may be used, in conjunction with standard techniques, 
to identify its corresponding pathway gene. For example, at 

10 least a portion of the amino acid sequence of the pathway 
gene product may be ascertained using techniques well known 
to those of skill in the art, such as via the Edman 
degradation technique (see, e.g., Creighton, 1983, Proteins: 
Structures and Molecular Principles, w.H. Freeman & Co. , 

15 N.Y., pp. 34-49). The amino acid sequence obtained may be 
used as a guide for the generation of oligonucleotide 
mixtures that can be used to screen for pathway gene 
sequences. Screening made be accomplished, for example by ' 
standard hybridization or PCR techniques. Techniques for the 

20 generation of oligonucleotide mixtures and screening are 
well-known. (See, e.g., Ausubel et al., eds., 1987-1993, 
Current Protocols in Molecular Biology, John Wiley & Sons, 
Inc. New York, and PCR Protocols: A Guide to Methods and 
Applications, 1990, Innis, M. et al., eds. Academic Press, 

25 inc. , New York) . 

Additionally, methods may be employed which result in 
the simultaneous identification of pathway genes which encode 
the protein interacting with the PGHS-2 gene product. These 
methods include, for example, probing expression libraries 

30 with labeled protein known or suggested to be involved in 
cardiovascular disease, using this protein in a manner 
similar to the well known technique of antibody probing of 
Xgtll libraries • 

One such method which detects protein interactions in 

35 vivo, the two-hybrid system, is described in detail for 

illustration only and not by way of limitation. One version 
of this system has been described (Chien et al., 1991, Proc. 
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pathway gene products and the genes which encode them 
represent pathway genes. 

Among the traditional methods which may be employed are 
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5 through gradients or chromatographic columns. Utilizing 
procedures such as these allows for the identification of 
pathway gene products. Once identified , a pathway gene 
product may be used, in conjunction with standard techniques, 
to identify its corresponding pathway gene. For example, at 

10 least a portion of the amino acid sequence of the pathway 
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to those of skill in the art, such as via the Edman 
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mixtures that can be used to screen for pathway gene 
sequences. Screening made be accomplished, for example by" 
standard hybridization or PCR techniques. Techniques for the 
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well-known. (See, e.g., Ausubel et al., eds., 1987-1993, 
Current Protocols in Molecular Biology, John Wiley & Sons, 
Inc. New York, and PCR Protocols: A Guide to Methods and 
Applications, 1990, Innis, M. et al., eds. Academic Press, 

25 inc. , New York) . 

Additionally, methods may be employed which result in 
the simultaneous identification of pathway genes which encode 
the protein interacting with the PGHS-2 gene product. These 
methods include, for example, probing expression libraries 

30 with labeled protein known or suggested to be involved in 
cardiovascular disease, using this protein in a manner 
similar to the well known technique of antibody probing of 
Xgtll libraries. 

One such method which detects protein interactions in 

35 vivo, the two-hybrid system, is described in detail for 

illustration only and not by way of limitation. One version 
of this system has been described (Chien et al., 1991, Proc. 
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Natl. Acad. Sci. USA, 88:9578-9582} and is commercially 
available from Clontech (Palo Alto, CA) . 

Briefly, utilizing such a system, plasmids are 
constructed that encode two hybrid proteins: one consists of 
5 the DNA-binding domain of a transcription activator protein 
fused to a known protein, and the other consists of the 
activator protein 1 s activation domain fused to an unknown 
protein that is encoded by a cDNA which has been recombined 
into this plasmid as part of a cDNA library. The plasmids 

10 are transformed into a strain of the yeast Saccharomyces 
cerevisiae that contains a reporter gene (e.g., lacZ) whose 
regulatory region contains the activator's binding sites. 
Either hybrid protein alone cannot activate transcription of 
the reporter gene: the DNA-binding domain hybrid because it 

15 does not provide activation function and the activation 

domain hybrid because it cannot localize to the activator's 
binding sites. Interaction of the two proteins reconstitutes 
the functional activator protein and results in expression of 
the reporter gene, which is detected by an assay for the 

20 reporter gene product. 

The two-hybrid system or related methodology may be used 
to screen activation domain libraries for proteins that 
interact with the PGHS-2 gene product, herein also called the 
known "bait" gene protein. Total genomic or cDNA sequences 

25 may be fused to the DNA encoding an activation domain. Such 
a library and a plasmid encoding a hybrid of the bait gene 
protein fused to the DNA-binding domain may be cotransf ormed 
into a yeast reporter strain, and the resulting transformants 
may be screened for those that express the reporter gene. 

30 These colonies may be purified and the library plasmids 
responsible for reporter gene expression may be isolated. 
DNA sequencing may then be used to identify the proteins 
encoded by the library plasmids. 

For example, and not by way of limitation, the bait gene 

35 may be cloned into a vector such that it is translationally 
fused to the DNA encoding the DNA-binding domain of the GAL4 
. protein. 
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Natl. Acad. Sci. USA, 88:9578-9582) and is commercially 
available from Clontech (Palo Alto, CA) . 

Briefly, utilizing such a system, plasmids are 
constructed that encode two hybrid proteins: one consists of 
5 the DNA-binding domain of a transcription activator protein 
fused to a known protein, and the other consists of the 
activator protein's activation domain fused to an unknown 
protein that is encoded by a cDNA which has been recombined 
into this plasmid as part of a cDNA library. The plasmids 

10 are transformed into a strain of the yeast Saccharomyces 
cerevlsiae that contains a reporter gene (e.g., lacZ) whose 
regulatory region contains the activator's binding sites. 
Either hybrid protein alone cannot activate transcription of 
the reporter gene: the DNA-binding domain hybrid because it 

15 does not provide activation function and the activation 

domain hybrid because it cannot localize to the activator's 
binding sites. Interaction of the two proteins reconstitutes 
the functional activator protein and results in expression of 
the reporter gene, which is detected by an assay for the 

20 reporter gene product. 

The two-hybrid system or related methodology may be used 
to screen activation domain libraries for proteins that 
interact with the PGHS-2 gene product, herein also called the 
known "bait" gene protein. Total genomic or cDNA sequences 

25 may be fused to the DNA encoding an activation domain. Such 
a library and a plasmid encoding a hybrid of the bait gene 
protein fused to the DNA-binding domain may be cotransformed 
into a yeast reporter strain, and the resulting transformants 
may be screened for those that express the reporter gene. 

30 These colonies may be purified and the library plasmids 
responsible for reporter gene expression may be isolated. 
DNA sequencing may then be used to identify the proteins 
encoded by the library plasmids. 

For example, and not by way of limitation, the bait gene 

35 may be cloned into a vector such that it is trans lationally 
fused to the DNA encoding the DNA-binding domain of the GAL 4 
protein. 
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A cDNA library of the cell line from which proteins that 
interact with bait gene are to be detected can be made using 
methods routinely practiced in the art. According to the 
particular system described herein , for example, the cDNA 
5 fragments may be inserted into a vector such that they are 
trans lationally fused to the activation domain of GAL4 . This 
library may be co- trans formed along with the bait gene-GAL4 
fusion plasmid into a yeast strain which contains a lacZ gene 
driven by a promoter which contains the GAL4 activation 

10 sequence. A cDNA encoded protein, fused to the GAL4 
activation domain, that interacts with bait gene will 
reconstitute an active GAL4 protein and thereby drive 
expression of the lacZ gene. Colonies which express lacZ may 
be detected by their blue color in the presence of X-gal. 

15 The cDNA may then be purified from these strains, and used to 
produce and isolate the bait gene-interacting protein using 
techniques routinely practiced in the art. 

Once a pathway gene has been identified and isolated, it 
may be further characterized as, for example, discussed 

20 herein. 

The proteins identified as products of pathway genes may 
be used to modulate PGHS-2 gene expression, as defined 
herein, or may themselves be targets for modulation to in 
turn modulate symptoms associated with PGHS-2 expression. 

25 

5.7. COMPOUNDS IDENTIFIED IN THE SCREENS 
The compounds identified in the screen will demonstrate 
the ability to selectively modulate the expression of PGHS-2. 
These compounds include but are not limited to nucleic acid 
30 encoding PGHS-2 and homologues, analogues, and deletions 
thereof, as well as antisense, ribozyme, triple helix, 
antibody, and polypeptide molecules and small inorganic 
molecules . 

35 5.8. PHARMACEUTICAL FORMULATIONS 

AND ROUTES OF ADMINISTRATION 

Any of the identified compounds can be administered to 
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25 
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an animal host, including a human patient, by itself , or in 
pharmaceutical compositions where it is mixed with suitable 
carriers or excipient(s) at doses therapeutically effective 
to treat or ameliorate a variety of disorders, including 
5 those characterized by insufficient, aberrant, or excessive 
PGHS-2 activity. A therapeutically effective dose further 
refers to that amount of the compound sufficient to result in 
amelioration of symptoms associated with such disorders. 
Techniques for formulation and administration of the 

10 compounds of the instant application may be found in 

"Remington's Pharmaceutical Sciences," Mack Publishing Co., 
Easton, PA, latest edition. 

A number of disorders in addition to inflammation have 
been characterized by insufficient, aberrant, or excessive 

15 PGHS-2 activity. In addition, several physiological states 
which may, from time to time be considered undesired, are 
also associated with PGHS-2 activity. By way of example, but 
not by way of limitation, such disorders and physiological 
states which may be treated with the compounds of the 

20 invention include but are not limited to neurologic disorders 
such as Alzheimer's disease, stroke, and acute head injury; 
colorectal carcinoma; ovulation; preterm labor; 
endometriosis; implantation; and pulmonary fibrosis. 

Pathological features of Alzheimer's Disease (AD) 

25 include neuritic amyloid plaques, neurofibrillary tangles, 
neuronal cell loss, loss of synapses, and marked gliosis. 
Because they are unique features of the disease, many 
investigators have focused on the etiology and effects of 
amyloid plaques and neurofibrillary tangles. However, the 

30 significant gains made in understanding these neuropathologic 
markers have provided few clues regarding treatment of AD. 
In contrast, recent findings suggest that the "inflammatory 
processes" associated with gliosis represent a potential 
target for therapeutic intervention in the disease. In 

35 particular, Joe Rogers and colleagues have presented both 
retrospective and prospective evidence that non-steroidal 
anti-inflammatory agents can significantly slow the progress 
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an animal host, including a human patient, by itself, or in 
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20 invention include but are not limited to neurologic disorders 
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Pathological features of Alzheimer's Disease (AD) 

25 include neuritic amyloid plaques, neurofibrillary tangles, 
neuronal cell loss, loss of synapses, and marked gliosis. 
Because they are unique features of the disease, many 
investigators have focused on the etiology and effects of 
amyloid plaques and neurofibrillary tangles. However, the 

30 significant gains made in understanding these neuropathology 
markers have provided few clues regarding treatment of AD. 
In contrast, recent findings suggest that the "inflammatory 
processes" associated with gliosis represent a potential 
target for therapeutic intervention in the disease. In 

35 particular, Joe Rogers and colleagues have presented both 
retrospective and prospective evidence that non-steroidal 
anti-inflammatory agents can significantly slow the progress 
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of AD (McGeer and Rogers, 1992, Neurology , 42:447-449; Rogers 
et al., 1993, Neurology, 43:1609-1611). Indeed, these 
results have prompted the initiation of anti- inflammatory 
therapy trials for AD. 
S Evidence for an "inflammatory component" to gliosis in 

AD includes increased expression of proinflammatory cytokines 
such as IL-ljS and TNFa (Griffin et al., 1989, Proc. Nat 1 !. 
Acad. Sci. USA, 88:7611-7615; Dickson et al., 1993, Glia, 
7:75-83; Lapchak and Araujo, 1993, Soc. Neurosci. Abstr., 

10 19:191) and the presence of activated complement components 
(McGeer et al., 1989, Neurosci. Let., 107: 341-346; Johnson 
et al., 1992, Neurobiol. Aging, 13:641-648; Walker and 
McGeer, 1992 Mol. Brain Res., 14:109-116). It should be 
noted that gliosis and the presence of proinflammatory 

15 cytokines- with the potential to activate PGHS-2 are not 

limited to AD. Rather, they are a feature of many insults to 
and disease of the central nervous system including (but not 
limited to) acute head injury, stroke, spinal cord injury, 
multiple sclerosis, HIV infection of the brain and other 

20 viral encephalopathies, and most neurodegenerative disorders 
(e.g. Huntington's disease, Parkinson's disease, and 
amyotrophic lateral sclerosis) . 

PGHS-2 is expressed in cultured murine and rat 
astrocytes, and is strongly up-regulated by treatment with 

25 proinflammatory cytokines including IL-ljS and TNFa (O'Banion 
et al., 1994, Soc. Neurosci. Abstr.). The induction of PGHS- 
2 is rapid with mRNA levels peaking at 2 h. Concomitant 
increases in prostaglandin production are also observed. The 
fact that induced cyclooxgenase activity is blocked by NS- 

30 398, a specific inhibitor of PGHS-2, confirms that induction 
of PGHS-2 is responsible for increased prostaglandin 
production in cytokine-treated astrocytes. As in other cell 
types, glucocorticoid hormones suppress the induction of 
PGHS-2 by IL-ljS. 

35 Other investigators have confirmed that PGHS-2 is 

expressed in the brain (Yamagata et al., 1993, Neuron, 
11:371-386). In these studies, the brains of rats subjected 
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of AD (McGeer and Rogers, 1992, Neurology, 42:447-449; Rogers 
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PGHS-2 by IL-10. 
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to electroconvulsive shock showed dramatic increases in the 
levels of PGHS-2 expression in neurons of the cerebral cortex 
and hippocampus. The authors further demonstrated that 
synaptic activation led to induction of PGHS-2 mRNA, 
5 suggesting that expression of this molecule plays a 

significant role (as yet undefined) in neuronal communication 
and/or function. In preliminary in situ hybridization 
studies it has been confirmed that PGHS-2 is expressed in 
human brain neurons (Chang et al., 1995, Soc. Neurosci. Ann. 

10 Mtg. San Diego, Submitted) . 

Similar to their proven therapeutic benefits in 
peripheral inflammation, it is proposed that the efficacy of 
nonsteroidal anti-inflammatory therapy in the treatment of AD 
is due to the inhibition of PGHS-2 activity in "inflamed" 

15 brain tissue. This therapeutic approach has the potential to 
benefit a multitude of neurological diseases and injuries 
with a prominent degree of glial activation. Development of 
selective inhibitors of human PGHS-2 which specifically 
target the central nervous system (i.e. that are designed to 

20 easily cross the blood-brain barrier and even accumulate in 
the brain) may prove much more efficacious than current 
NSAIDS for the treatment of AD and other neurologic 
disorders. 

Colorectal carcinoma is a leading cause. of death in 
25 westernized countries. Prostaglandins have been correlated 
with carcinogenesis in general and more specifically with 
colorectal cancer, Marnett, 1992, Cancer Research, 52:5575- 
5589. In several clinical trials, aspirin use was associated 
with decreased colon tumor growth and death, Thun et al., 
30 1991, N. Engl. J. Med., 325:1593-6; Kune, et al., 1988, 
Cancer Res., 48:439-404. Sulindac, another cyclooxygenase 
inhibitor, has been demonstrated to cause colon polyp 
regression in patients with familial polyposis, Waddell and 
Loughry, 1983, J. Surg. Oncol., 24:83-87. These NSAIDS are 
35 able to inhibit both PGHS-1 and -2. Discovery of the gene 
for PGHS-2 makes clarification of the relative contribution 
or role in colon cancer possible. PGHS-2 is an immediate 
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early gene suggesting its likely participation in regulating 
growth* The decreased tumor growth by aspirin is likely 
through action on PGHS-2. If PGHS-2 is directly implicated 
then specific inhibition of this enzyme may result in tumor 
5 suppression. Discovery of the PGHS-2 gene allows for further 
clarification of this contribution. Additionally, if 
inhibition is therapeutic then specific drugs that inhibit 
PGHS-2 can be obtained that would be ingested and directly 
act at the mucosal and have limited systemic absorption. In 

10 the case of familial polyposis, gene therapy may play an 
important therapeutic role. 

Ovulation has in a broad sense can be viewed as an 
inf laramatory process initiated by the LH surge during the 
menstrual cycle, Espey, 1980, Biol. Reprod, 22:73-106. 

15 NSAIDs have been shown to inhibit ovulation in a number of 
model systems, Espey, 1982, Prostaglandin, 23:329-335. By 
inhibiting prostaglandin formulation and interrupting the 
inflammatory response ovulation is halted. It has been 
demonstrated that PGHS-2 is specifically stimulated by LH in 

20 granulosa cells at the time of ovulation and likely the 
target of NSAIDs that results in inhibition of ovulation, 
Sirois and Richards, 1992, J. Biol. Chem. , 267:6382-6388. 
Knowing the gene sequence and protein product not only 
provides the ability to further study this process but 

25 provides a specific target for contraception. PGHS-2 

specific drugs would allow inhibition without effecting the 
prostaglandin production by PGHS-1 which is protective to GI 
mucosa as well as involved with kidney function and many 
other homeostatic mechanisms. 

30 Preterm labor is a significant clinical problem. 

Current available drugs (tocolytics) are able to postpone 
labor but often are not able to stop labor definitively. 
Prostaglandins play an important role in induction of labor 
although their exact contribution and mechanism are yet to be 

35 clearly defined, Kelly, 1994, Endocrine Reviews, 15(5) :684- 
706. With the discovery of PGHS-2 a better understanding of 
prostaglandin regulation in the fetus and uterus can be 
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understood. Current medications used for preterm labor 
(tocolytics) work by blocking Ca flux thereby interfering 
with myometrium contraction. Common tocolytics include 
magnesium sulfate, 0-adrenergic receptor agonists, calcium 
5 channel blockers and oxytocin antagonists. Indomethacin has 
also been used effectively but raises concern with premature 
closure of the ductus arteriosus of the fetus. Closer 
examination of PGHS-1 and PGHS-2 in these roles may provide 
opportunities for specific intervention. 
10 Recognition of preterm labor prior to cervical changes 

is difficult but also the point at which tocolytic agents are 
most effective. It is known that prostaglandins are 
intimately involved in myometrium contraction of normal 
labor, Williams Obstetrics, Cunningham, MacDonald, Gant, 
15 Leveno, and Gilstrap (eds) Williams Obstetrics 19th Ed. 

Appleton and Lange, Norwalk CT, 1993. It may be possible to 
evaluate increased PGHS-2 expression and true labor prior to 
cervical changes. If safe sampling of the site of expression 
can be done then PCR methods may be able to provide a timely 
20 answer to whether the painful uterine contractions are 
Braxton-Hicks or true labor. 

Dysmenorrhea and endometriosis are common, painful 
problematic conditions for women. It is well known that 
NSAIDs are extremely effective at treating dysmenorrhea and 
25 endometriosis pain by inhibiting prostaglandin production. 
It is highly likely that the hormones responsible for the 
cycle of dysmenorrhea and endometriosis also regulates PGHS-2 
expression. Inhibition at the protein or genetic level could 
enhance specific treatment for dysmenorrhea and 
30 endometriosis. 

Prostaglandin formation is also part of implantation. 
Manipulation of PGHS-2 expression may provide a means for 
induction of abortion. 

PGHS-2 may play an important role in the lung pathology 
35 of cystic fibrosis. It has been demonstrated that high-dose 
ibuprofen slows the progression of lung disease in this 
patient population, Konstan, et al., 1995, N. Engl. J. Med., 
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332 (13) :848-854. Lung disease results more from the 
inflammatory response than by the colonization of bacteria. 
Utilization of inhalers can directly deliver medication to 
the site of inflammation. This may provide a logical disease 
5 process to attempt anti-sense, ribozyme or triple helix gene 
therapy aimed at inhibiting PGHS-2 expression. 

Besides attempts to inhibit cell growth by inhibiting 
PGHS-2 there may be certain circumstances whereby growth 
stimulation is desired as in tissue repair. Determination of 

10 the tissue specific regulation of PGHS-2 (studies which 

require gene sequence information) may lead to the ability to 
specifically up regulate PGHS-2 in particular cell types 
(i.e. fibroblasts, neurons). Additionally genetic constructs 
which will only be activated in particular cell types because 

15 of promoter construction could be developed. 

Other options may include direct delivery of enzyme 
which has been produced and purified by genetic means using 
the cloned gene. Recombinant protein would also greatly 
facilitate investigation into the, distinctions between the 

20 enzymes (PGHS-1 and PGHS-2) and the byproducts they produce. 
Other isoforms may exist and may be cloned utilizing 
PGHS-2 sequence. 

The compounds of the invention may be designed or 
administered for tissue specificity. If the compound 
-25 comprises a nucleic acid molecule , including those comprising 
an expression vector, it may be linked to a regulatory 
sequence which is specific for the target tissue, such as the 
brain, skin, joints, bladder, kidney, liver, ovary, etc. by 
methods which are known in the art including those set forth 

30 in Hart, 1994, Ann. Oncol. , 5 Suppl 4: 59-65; Dahler et al., 
1994, Gene, 145: 305-310; DiMaio et al., 1994, Surgery, 
116:205-213; Weichselbaum et al., Cancer Res.-, 54 :4266-4269; 
Harris et al., 1994, Cancer, 74 (Suppl. 3) : 1021-1025; 
Rettinger et al., Proc. Nat'l. Acad. Sci. USA, 91:1460-1464; 

35 and Xu et al, Exp. Hematol., 22:223-230; Brigham et al., 

1994, Prog. Clin. Biol. Res., 388:361-365. The compounds of 
the invention may be targeted to specific sites of 
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inflammation by direct injection to those sites, such as 
joints, in the case of arthritis. Compounds designed for use 
in the central nervous system should be able to cross the 
blood brain barrier or be suitable for administration by 
5 localized injection. Similarly, compounds specific for the 
bladder can be directly injected therein. Compounds may also 
be designed for confinement in the gastrointestinal tract for 
use against disorders such as colorectal carcinoma. In 
addition, the compounds of the invention which remain within 

10 the vascular system may be useful in the treatment of 
vascular inflammation which might arise as a result of 
arteriosclerosis, balloon angioplasty, catheterization, 
myocardial infarction, vascular occlusion, and vascular 
surgery and which have already been associated with PGHS-2 by 

15 Pritchard et al., 1994, J. Biol. Chem. , 269, 8504-8509. Such 
compounds which remain within the bloodstream may be prepared 
by methods well known in the art including those described 
more fully in Mclntire, 1994, Annals Biomed. Engineering, 
22:2-13. 

20 

5.8.1. EFFECTIVE DOSAGE 
Pharmaceutical compositions suitable for use in the 
present invention include compositions wherein the active 
ingredients are contained in an effective amount to achieve 

25 its intended purpose. More specifically, a therapeutically 
effective amount means an amount effective to prevent 
development of or to alleviate the existing symptoms of the 
subject being treated. Determination of the effective 
amounts is well within the capability of those skilled in the 

30 art, especially in light of the detailed disclosure provided 
herein. 

For any compound used in the method of the invention, 
the therapeutically effective dose can be estimated initially 
from cell culture assays. For example, a dose can be 
35 formulated in animal models to achieve a circulating 

concentration range that includes the IC50 (the dose where 
50% of the cells show the desired effects) as determined in 
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cell culture. Such information can be used to more 
accurately determine useful doses in humans. 

A therapeutically effective dose refers to that amount 
of the compound that results in amelioration of symptoms or a 
5 prolongation of survival in a patient. Toxicity and 

therapeutic efficacy of such compounds can be determined by 
standard pharmaceutical procedures in cell cultures or 
experimental animals, e.g., for determining the LD50 (the 
dose lethal to 50% of the population) and the ED50 (the dose 

10 therapeutically effective in 50% of the population) . The 
dose ratio between toxic and therapeutic effects is the 
therapeutic index and it can be expressed as the ratio 
between LD50 and ED50. Compounds which exhibit high 
therapeutic indices are preferred. The data obtained from 

15 these cell culture assays and animal studies can be used in 
formulating a range of dosage for use in human. The dosage 
of such compounds lies preferably within a range of 
circulating concentrations that include the ED50 with little 
or no toxicity. The dosage may vary within this range 

20 depending upon the dosage form employed and the route of 
administration utilized. The exact formulation, route of 
administration and dosage can be chosen by the individual 
physician in view of the patient's condition. (See e.g. 
Fingl et al., 1975, in "The Pharmacological Basis of 

25 Therapeutics", Ch. 1 pi). Dosage amount and interval may be 
adjusted individually to provide plasma levels of the active 
moiety which are sufficient to maintain the desired effects. 

In cases of local administration or selective uptake, 
the effective local concentration of the drug may not be 

30 related to plasma concentration. 

The amount of composition administered will, of course, 
be dependent on the subject being treated, on the subjects 
weight, the severity of the affliction, the manner of 
administration and the judgment of the prescribing physician. 

35 

5.8.2. COMPOSITION AND FORMULATION 
The pharmaceutical compositions of the present invention 
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may be manufactured in a manner that is itself known, e.g., 
by means of conventional mixing, dissolving, granulating, 
dragee-making, levigating, emulsifying, encapsulating, 
entrapping or lyophilizing processes. 
5 Pharmaceutical compositions for use in accordance with 

the present invention thus may be formulated in conventional 
manner using one or more physiologically acceptable carriers 
comprising excipients and auxiliaries which facilitate 
processing of the active compounds into preparations which 

10 can be used pharmaceutical^ . Proper formulation is 
dependent upon the route of administration chosen. 

For injection, the agents of the invention may be 
formulated in aqueous solutions, preferably in 
physiologically compatible buffers such as Hanks f s solution, 

15 Ringer's solution, or physiological saline buffer. For 
transmucosal administration, penetrants appropriate to the 
barrier to be permeated are used in the formulation. Such 
penetrants are generally known in the art. 

For oral administration, the compounds can be formulated 

20 readily by combining the active compounds with 

pharmaceutically acceptable carriers well known in the art. 
Such carriers enable the compounds of the invention to be 
formulated as tablets, pills, dragees, capsules, liquids, 
gels, syrups, slurries, suspensions and the like, for oral 

25 ingestion by a patient to be treated. Pharmaceutical 

preparations for oral use can be obtained solid excipient, 
optionally grinding a resulting mixture, and processing the 
mixture of granules, after adding suitable auxiliaries, if 
desired, to obtain tablets or dragee cores. Suitable 

30 excipients are, in particular, fillers such as sugars, 

including lactose, sucrose, mannitol, or sorbitol; cellulose 
preparations such as, for example, maize starch; wheat 
starch, rice starch, potato starch, gelatin, gum tragacanth, 
methyl cellulose, hydroxypropylmethyl-cellulose, sodium 

35 carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP) . 
If desired, disintegrating agents may be added, such as the 
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preparations for oral use can be obtained solid excipient, 
optionally grinding a resulting mixture, and processing the 
mixture of granules, after adding suitable auxiliaries, if 
desired, to obtain tablets or dragee cores. Suitable 

30 excipients are, in particular, fillers such as sugars, 

including lactose, sucrose, mannitol, or sorbitol; cellulose 
preparations such as, for example, maize starch; wheat 
starch, rice starch, potato starch, gelatin, gum tragacanth, 
methyl cellulose, hydroxypropylmethyl-cellulose, sodium 

35 carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP) . 
If desired, disintegrating agents may be added, such as the 
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cross-linked polyvinyl pyrrolidone, agar, or alginic acid or 
a salt thereof such as sodium alginate. 

Dragee cores are provided with suitable coatings. For 
this purpose, concentrated sugar solutions may be used, which 
5 may optionally contain gum arabic, talc, polyvinyl 
pyrrolidone, carbopol gel, polyethylene glycol, and/or 
titanium dioxide, lacquer solutions, and suitable organic 
solvents or solvent mixtures. Dyestuffs or pigments may be 
added to the tablets or dragee coatings for identification or 
10 to characterize different combinations of active compound 
doses. 

Pharmaceutical preparations which can be used orally 
include push-fit capsules made of gelatin, as well as soft, 
sealed capsules made of gelatin and a plasticizer, such as 

15 glycerol or sorbitol. The push-fit capsules can contain the 
active ingredients in admixture with filler such as lactose, 
binders such as starches, and/or lubricants such as talc or 
magnesium stearate and, optionally, stabilizers. In soft 
capsules, the active compounds may be dissolved or suspended 

20 in suitable liquids, such as fatty oils, liquid paraffin, or 
liquid polyethylene glycols. In addition, stabilizers may be 
added. All formulations for oral administration should be in 
dosages suitable for such administration. 

For buccal administration, the compositions may take the 

25 form of tablets or lozenges formulated in conventional 
manner. 

For administration by inhalation, the compounds for use 
according to the present invention are conveniently delivered 
in the form of an aerosol spray presentation from pressurized 

30 packs or a nebulizer, with the use of a suitable propellant, 
e.gr. , dichlorodif luoromethane, trichlorof luoromethane, 
- dichlorotetraf luoroethane, carbon dioxide or other suitable 
gas. In the case of a pressurized aerosol the dosage unit 
may be determined by providing a valve to deliver a metered 

35 amount. Capsules and cartridges of e.g. gelatin for use in 
an inhaler or insufflator may be formulated containing a 
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powder mix of the compound and a suitable powder base such as 
lactose or starch. 

The compounds may be formulated for parenteral 
administration by injection, e.g., by bolus injection or 
5 continuous infusion. Formulations for injection may be 

presented in unit dosage form, e.g., in ampoules or in multi- 
dose containers, with an added preservative. The 
compositions may take such forms as suspensions, solutions or 
emulsions in oily or aqueous vehicles, and may contain 

10 formulatory agents such as suspending, stabilizing and/or 
dispersing agents. 

Pharmaceutical formulations for parenteral 
administration include aqueous solutions of the active 
compounds in water-soluble form. Additionally, suspiensions 

15 of the active compounds may be prepared as appropriate oily 
injection suspensions. Suitable lipophilic solvents or 
vehicles include fatty oils such as sesame oil, or synthetic 
fatty acid esters, such as ethyl oleate or triglycerides, or 
liposomes. Aqueous injection suspensions may contain 

20 substances which increase the viscosity of the suspension, 
such as sodium carboxymethyl cellulose, sorbitol, or dextran. 
Optionally, the suspension may also contain suitable 
stabilizers or agents which increase the solubility of the 
compounds to allow for the preparation of highly concentrated 

25 solutions. 

Alternatively, the active ingredient may be in powder 
form for constitution with a suitable vehicle, e.g., sterile 
pyrogen-free water, before use. 

The compounds may also be formulated in rectal 
30 compositions such as suppositories or retention enemas, e.g., 
containing conventional suppository bases such as cocoa 
butter or other glycerides. 

In addition to the formulations described previously, 
the compounds may also be formulated as a depot preparation. 
35 Such long acting formulations may be administered by 

implantation (for example subcutaneously or intramuscularly) 
or by intramuscular injection. Thus, for example, the 
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compounds may be formulated with suitable polymeric or 
hydrophobic materials (for example as an emulsion in an 
acceptable oil) or ion exchange resins , or as sparingly 
soluble derivatives, for example, as a sparingly soluble 
5 salt. 

A pharmaceutical carrier for the hydrophobic compounds 
of the invention is a cosolvent system comprising benzyl 
alcohol, a nonpolar surfactant, a water-miscible organic 
polymer, and an aqueous phase. Naturally, the proportions of 
10 a co-solvent system may be varied considerably without 
destroying its solubility and toxicity characteristics. 
Furthermore, the identity of the co-solvent components may be 
varied. 

Alternatively, other delivery systems for hydrophobic 
15 pharmaceutical compounds may be employed. Liposomes and 
emulsions are well known examples of delivery vehicles or 
carriers for hydrophobic drugs. Certain organic solvents 
such as dimethylsulf oxide also may be employed, although 
usually at the cost of greater toxicity. Additionally, the 
20 compounds may be delivered using a sustained-release system, 
such as semipermeable matrices of solid hydrophobic polymers 
containing the therapeutic agent. Various of sustained- 
release materials have been established and are well known by 
those skilled in the art. Sustained-release capsules may, 
„25 depending on their chemical nature, release the compounds for 
a few weeks up to over 100 days. Depending on the chemical 
nature and the biological stability of the therapeutic 
reagent, additional strategies for protein stabilization may 
be employed. 

30 The pharmaceutical compositions also may comprise 

suitable solid or gel phase carriers or excipients. Examples 
of such carriers or excipients include but are not limited to 
calcium carbonate, calcium phosphate, various sugars, 
starches, cellulose derivatives, gelatin, and polymers such 

35 as polyethylene glycols. 

Many of the compounds of the invention may be provided 
as salts with pharmaceutically compatible counterions. 
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Pharmaceutical ly compatible salts may be formed with many 
acids, including but not limited to hydrochloric, sulfuric, 
acetic, lactic, tartaric, malic, succinic, etc. Salts tend 
to be more soluble in aqueous or other protonic solvents that 
5 are the corresponding free base forms. 

5.8.3. ROUTES OF ADMINISTRATION 
Suitable routes of administration may, for example, 

include oral, rectal, transmucosal , transdermal, or 
10 intestinal administration; parenteral delivery, including 

intramuscular, subcutaneous, intramedullary injections, as 

well as intrathecal, direct intraventricular, intravenous, 

intraperitoneal, intranasal, or intraocular injections. 

Alternately, one may administer the compound in a local 
15 rather than systemic manner, for example, via injection of 

the compound directly into an affected area, often in a depot 

or sustained release formulation. 

Furthermore, one may administer the drug in a targeted 

drug delivery system, for example, in a liposome coated with 
20 an antibody specific for affected cells. The liposomes will 

be targeted to and taken up selectively by the cells. 

5.8.4. PACKAGING 

The compositions may, if desired, be presented in a pack 
25 or dispenser device which may contain one or more unit dosage 
forms containing the active ingredient. The pack may for 
example comprise metal or plastic foil, such as a blister 
pack. The pack or dispenser device may be accompanied by 
instructions for administration. Compositions comprising a 
30 compound of the invention formulated in a compatible 

pharmaceutical carrier may also be prepared, placed in an 
appropriate container, and labelled for treatment of an 
indicated condition. Suitable conditions indicated on the 
label may include treatment of a disease such as one 
35 characterized by insufficient, aberrant, or excessive PGHS-2 
activity. 
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6. EXAMPLE: ISOLATION, CLONING, AND 

SEQUENCING OF MURINE PGHS-2 

The subsections below describe the identification and 

characterization of the murine PGHS-2 gene and gene product. 

5 The data demonstrate that PGHS-2 encodes a functional 

prostaglandin H synthase which is distinct from the product 

of the PGHS-1 gene. In addition, it is shown that 

Dexamethasone specifically down-regulates PGHS-2 expression 

while having no effect on PGHS-1 expression. 

10 

6.1. MATERIALS AND METHODS 

6.1.1. CELLS AND CELL CULTURES 
C127 mouse fibroblasts were obtained from Peter Howley 
15 (NIH) and propagated in high glucose Dulbecco's modified 
Eagle's medium supplemented with 10% fetal bovine serum 
(HyClone Laboratories) without antibiotics. See, Lowy et 
al., 1978, J. Virol. , 26:291-298. Cultures were monitored, 
for mycoplasma contamination by Hoechst 33258 staining in 
20 accord with the procedure of Chen, 1977, Exp. Cell Res., 
104:255-262. 

Exponentially growing, subconfluent (60-80%) cell 
monolayers (35-mm plates) were labeled in Dulbecco's modified 
Eagle's medium without methionine (Life Technologies) plus 

25 200 /iCi/ml Tran 35 S-label (>1,000 Ci/mmol; ICN) for 15 or 30 
min. In some cases, fresh fetal calf serum (10%) was present 
during the labeling period. Monolayers were rinsed twice 
with ice-cold Dulbecco's modified Eagle's medium (DMEM) with 
methionine prior to lysis in 200 fil of A8 buffer (9.5 M urea, 

30 2% ( w /v) Nonidet P-40, 2% (w/v) ampholines (LKB, 1.6% pH 
range 5-8, 04.% pH range 3.5-10), 5% (w/v) 2- 
mercaptoethanol) . Incorporation of label into proteins was 
determined by trichloroacetic acid precipitation. 
Dexamethasone (Sigma) was freshly prepared in phosphate- 

35 buffered saline (PBS) (stock concentrations based on molar 
extinction coefficient of 1.5 X 10 4 liters/mol/cm at 250 nn) 
and added to 1 /iM. The calcium ionophore A23187 (Calbiochem) 
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was used at a concentration of 5 /xM from a 2.5 mM stock in 
ethanol. Cycloheximide (Sigma) was used at a concentration 
of 25 /iM from a 100 X stock in water. This level inhibited 
protein synthesis by >97% within 15 min. Control cultures 
5 received appropriate amounts of solvents. 

6.1.2. DETERMINATION OF C YCLO 0XYGENAS E 
ACTIVITY 

Cyclooxygenase activity was determined in the cultures 
10 by addition of media containing exogenous arachidonic acid 
substrate (30 for 15 min. at 37 °C) followed by conversion 
of the prostaglandin E 2 product to a methyl oximate form. 
This derivative was then quantitated by radioimmunoassay (kit 
from Amersham Corp . ) . 

15 

6.1.3. RNA PREPARATION 

Total RNA was isolated from 15-cm plates using 
guanidinium isothiocyanate lysis followed by centrifugatiori 
through a cesium chloride cushion, Chirgwin et al., 1979 , 
20 Biochemistry, 18:5294-5299. Poly (A) RNA was prepared by two 
passes through oligo(dT) -cellulose columns, as disclosed by 
Aviv et al., 1972, Proc. Nat'l. Acad. Sci. USA, 69:1408-1412. 
RNAs were quantitated by absorbance measurements at 260 nm. 

25 6.1.4. cDNA SYNTHESIS 

Fifty iiq of poly-A enriched RNA from C127 cells treated 
for 2.5 hr. with serum and cycloheximide (25 /xm) were 
fractionated on a 10-30% sucrose gradient in the presence of 
10 mM CH 3 HgOH as disclosed by J. Sambrook et al., cited above. 

30 Every other fraction was assayed for the presence of the 4kb 
mRNA (C^Banion, et al., 1991, J. Biol. Chem. , 266:23261-23267 
by Northern blot analysis using the 1.6 kb 5 1 end of the 
ovine PGHS cDNA (obtained from Oxford Biomedical Research , 
Inc.) labeled by random priming. RNA samples and molecular 

35 weight markers (3 /xg; Bethesda Research Laboratories RNA 
ladder) were subjected to f ormaldehyde-agarose gel 
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electrophoresis (J. Sambrook et al., Molecular Cloning, cited 
above at pages 7. 30-7 . 32) and then blotted to nylon membranes 
(Duralon, Stratagene) by overnight capillary transfer in 10 x 
SSC (1 X SSC is 0,15 M NaCl, 0.015 M sodium citrate). 
5 cDNAs were prepared from fractions enriched in the 4-kb 

mRNA by oligo(dT) priming (Gubler et al., 1988, Gene (Amst.), 
25:263 kit from Stratagene) and ligated into X-ZAP II (Short 
et al., 1988, Nucleic Acids Res., 16:7583-7600, Stratagene). 
Two hundred fifty thousand plaques were screened with the 

10 ovine PGHS probe under conditions of reduced stringency (30% 
formamide, hybridization temperature reduced to 42 °C, filters 
washed in 2 X SSC + 0.1% at 55°C) . Double-strand dideoxy 
termination sequencing of Exo III nested deletion subclones 
was carried out in both directions using T7 DNA polymerase. 

15 See Heinikoff, 1984, Gene, 28:351; Del Sal et al., 1989, Bio- 
Techniques , 7:514-520. 

6.1.5. IN VITRO TRANSCRIPTION, IN VITRO 
TRANSLATION, IMMUNOPRECIPITATION, 

AND PRIMER EXTENSION 

20 ~ 

One jig of cDNA in a Bluescript vector (Stratagene) was 
linearized at the 3 1 end with Xho I and transcribed with T3 
RNA polymerase in a reaction containing the capping reagent 
m^S'JpppfS^G (kit from Stratagene). After purification, 

25 one-fifth of the transcribed RNA and 2.5 /xg of poly-A RNA 
purified as described above, from cycloheximide and serum- 
treated C127 cells were translated in separate in vitro 
reactions containing 35 S-methionine as described by the 
manufacturer (Pr omega) except that the RNAs were preincubated 

30 with 3.5 mM CH 3 HgOH for 10 min at room temperature. Reactions 
were diluted in a modified RIPA buffer and precipitated with 
polyclonal anti-PGHS serum (Oxford Biomedical Research, Inc.) 
or first precleared by incubating for 30 min with 50 /i/lml 
protein A-Sepharose (Pharmacia LKB Biotechnology Inc. ; 50% 

35 (v/v)). 0.01 volume of antiserum or normal rabbit serum was 
added to the lysate and allowed to incubate for 2 hr at 4°C 
prior to precipitation with protein A-Sepharose. The 
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pelleted beads were washed four times with 
immunoprecipitation buffer and then resuspended in Laemmli 
lysis buffer for 30 min at room temperature. The 
immunoprecipitated products were resolved by standard 10% 
5 SDS-PAGE and visualized by f luorography . 

For primer extension analysis two of poly-A RNA from 
C127 cells treated for 2 hr with serum and cycloheximide was 
reverse-transcribed with M-MuLV reverse transcriptase (Life 
Technologies) as described by Baker et al., 1987, EMBO J. , 

10 6:1027-1035, using a 32 P-end-labeled oligonucleotide 

complementary to nucleotide (nt) 55-75 of the sequenced 4.1 
kb cDNA. Reaction products were electrophoresed on a 
standard sequencing gel in parallel with an 35 S-labeled 
dideoxy sequencing reaction of the cDNA in its Bluescript 

15 vector using the same primer. 

6.1.6. CDNA EXPRESSION AND PGE, DETERMINATION 
In order to determine whether the 4 . 1 kb mRNA encodes a 
protein with cyclooxygenase activity, the cDNA was inserted 

20 into an SV40 late promoter expression vector (SVL, (Breatnach 
et al., 1983, Nucleic Acid Res., 11:7119. As reported by 
DeWitt et al., 1990, J. Biol. Chem. , 265:5192-5198, COS cells 
have little or no autologous cyclooxygenase activity. 
Therefore, these cells were transfected with 2.5 or 5 /ig of 

25 either the vector alone or the vector containing the 4.1 kb 
cDNA. 

6.1,7. NORTHERN BLOT ANALYSIS 
Poly-A enriched RNAs (2.5 pq) from C127 cells were 
fractionated by f ormaldehyde-agarose gel electrophoresis and 

30 transferred to a membrane (Duralon, Stratagene) . 

Hybridization was carried out as previously described by 
O'Banion et al., 1991, J. Virol.> 65:3481-3488, iising the 5 1 
1.2 kb EcoRl fragment of the 4.1 kb cDNA labeled with M P by 
random priming as disclosed by Feinberg et al., 1983, Anal. 

35 Biochem., 132:6-13. The membrane was later rehybridized with 
a similarly labeled portion (1.6 kb 5 1 end) of the 2.8 kb 



- 63 - 



WO 96/40720 



PCT/US96/08311 



pelleted beads were washed four times with 
immunoprecipitation buffer and then resuspended in Laemmli 
lysis buffer for 30 min at room temperature. The 
immunoprecipitated products were resolved by standard 10% 

5 SDS-PAGE and visualized by f luorography . 

For primer extension analysis two /ig of poly-A RNA from 
C127 cells treated for 2 hr with serum and cycloheximide was 
reverse-transcribed with M-MuLV reverse transcriptase (Life 
Technologies) as described by Baker et al., 1987 , EMBO J. f 

0 6:1027-1035, using a 32 P-end-labeled oligonucleotide 

complementary to nucleotide (nt) 55-75 of the sequenced 4.1 
kb cDNA. Reaction products were electrophoresed on a 
standard sequencing gel in parallel with an 35 S-labeled 
dideoxy sequencing reaction of the cDNA in its Bluescript 

5 vector using the same primer. 

6.1.6. CDNA EXPRESSION AND PGE^ DETERMINATION 
In order to determine whether the 4.1 kb mRNA encodes "a 
protein with cyclooxygenase activity, the cDNA was inserted 

• into an SV40 late promoter expression vector (SVL, (Breatnach 
et al., 1983, Nucleic Acid Res., 11:7119. As reported by 
DeWitt et al., 1990, J. Biol. Chem., 265:5192-5198, COS cells 
have little or no autologous cyclooxygenase activity. 
Therefore, these cells were transfected with 2.5 or 5 /ig of 

5 either the vector alone or the vector containing the 4.1 kb 
cDNA. 

6.1.7. NORTHERN BLOT ANALYSIS 
Poly-A enriched RNAs (2.5 jug) from C127 cells were 
fractionated by forma ldehyde-agarose gel electrophoresis and 

J transferred to a membrane (Duralon, Stratagene) . 
Hybridization was carried out as previously described by 
O'Banion et al., 1991, J. Virol."* 65:3481-3488, Using the 5* 
1.2 kb EcoRl fragment of the 4.1 kb cDNA labeled with M P by 
random priming as disclosed by Feinberg et al., 1983, Anal. 

> Biochem., 132:6-13. The membrane was later rehybridized with 
a similarly labeled portion (1.6 kb 5 1 end) of the 2.8 kb 



WO 96/40720 



PCTVUS96/08311 



ovine PGHS cDNA (Oxford Biomedical Research, Inc.), and an 
end-labeled 40-mer complimentary to j3-tubulin (Oncor) . RNA 
molecular weight markers (Life Technologies) were visualized 
by ethidium bromide staining. A similar analysis was 
5 performed on total RNA (5 fxg/lane) isolated from human 

monocytes by the guanidinium-acid-phenol extraction method of 
Chomczynski et al., 1987, Anal. Biochem. , 162:156-159. 

6.1.8. EXPRESSIONS OF PGHS-2 IN HUMAN 
10 MONOCYTES 

Adherent human monocytes isolated from healthy donors as 
described by Roberts et al., 1978, J. Immunol, 121:1052-1058, 
were suspended in M199 medium without serum at 1 x 10 6 
cells/ml. One ml aliquots in 5 ml polypropylene tubes were 

15 incubated with loosened caps in 5% C0 2 at 37 °C with occasional 
shaking. To derive the autoradiograph shown in Figure 3A, 
monocytes were incubated for 4 hr in the presence or absence 
of dexamethasone (1 /zM; Sigma) prior to total RNA isolation 
by the procedure of P. Chomczynski et al., cited above. Five 

20 Mg rna was subjected to Northern blot analysis as described 
by O'Banion et al., 1991, J. Biol. Chem., 34:23261-23267 with 
the indicated probes labeled by random priming (kit from 
Boehringer-Mannheim) to a specific activity >l x 10 9 cpm//ig. 
To derive the autoradiograph shown in Figure 3B, monocytes 

25 were treated with dexamethasone (1 /xM) , IL-10 (10 half- 
maximal units, Collaborative Research), or both for the 
indicated times prior to RNA isolation. Cycloheximide (25 
jiM; Sigma) was added to one set of incubations 15 min prior 
to the addition of cytokine or hormone. 

30 

6.2. RESULTS 

6.2-1- IDENTIFICATION AND CHARACTERIZATION 
OF PGHS-2 



A directionally cloned cDNA library was constructed in 
lambda ZAP II from sucrose gradient fractions enriched in the 
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4 kb mRNA identified in O'Banion, et al., 1991, J. Biol. 
Chem., 35:23261-23267 and screened with a radiolabeled 
portion of the 2.8 kb PGHs cDNA under conditions of lowered 
stringency. Several positive plaques were isolated and 
5 analyzed. One, about 4.1 kb in length, was fully sequenced. 
This clone encodes a 70 kDa protein specifically precipitated 
by anticyclooxygenase serum, which migrates identically with 
the immunoprecipitated protein product from in vitro 
translated poly A-mRNA. Primer extension analysis, using a 

10 20-mer starting at nt 75 of the sequence, indicated that 
transcription starts 24 bases upstream of the cDNA clone. 
Comparison of the 4.1 kb sequence (Fig. 1) with that of the 
previously cloned 2.8 kb PGHS cDNA from mice (which is very 
similar to that cloned from sheep and human tissues) , 

15 revealed a single open reading frame with 64% amino acid 

identity to the protein encoded by the 2.8 kb PGHS cDNA. The 
deduced protein sequences are colinear except that the 4.1 kb 
cDNA has shorter amino-terminus and longer carboxy-terminus. 
The full sequence has been deposited in GenBank, accession 

20 number M88242. 

6.2.2. PGHS-2 CDNA EXPRESSION IN COS CELLS 
PRODUCED A FUNCTIONAL PROSTAGLANDIN 
H SYNTHASE 

25 Two-dimensional gel electrophoresis of 35 S-labeled 

proteins from transfected cells showed a protein doublet 
(72/74 kDa, pi 7.5) in the 4.1 kb cDNA-expressing cells that 
corresponds exactly to the immunoprecipitated cyclooxygenase 
protein doublet observed in C127 mouse fibroblasts whose 

30 synthesis is increased by growth factors and decreased by 
glucocorticoid hormones . 

Transfected cells were also assayed for cyqlooxygenase 
activity. COS cells expressing the 4.1 kb cDNA produced 
nearly two orders of magnitude more prostaglandin Ej than 

35 control cells {Table 2) . Furthermore, prostaglandin 

production increased with the amount of transfected DNA. 
These results unequivocally demonstrate that the 4.1 kb mRNA 
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enclodes an active cyclooxygenase which was designated 
"glucocorticoid-regulated inflammatory PGHS (griPGHS) . 

Table 2 

5 Expression of the 4*1 kb cDNA in COS cells leads to 
prostaglandin synthesis. Subconfluent COS A. 2 cells in 
duplicate 60 mm plates were transfected with the indicated 
amounts of expression vector alone (SVL) or the expression 
vector containing the 4.1 kb cDNA (SVL-4.1) and assayed for 
10 pgEjj production 2 days later. 



DNA Amount pg PGEj/jig protein 



None 






0.56, 


0.58, 0.51, 0.50 


SVL 


2.5 


Mg 


0.55, 


0.68 


SVL 


5.0 


Mg 


0.63, 


0.65 


SVL-4 . 1 


2.5 


Mg 


14.8, 


24.6 


SVL-4.1 


5.0 


Mg 


63.8, 


42.4 



20 

6.2.3. DEXAMETHASONE SPECIFICALLY 

REDUCES EXPRESSION OF PGHS-2 

AND NOT PGHS-1 IN HUMAN MONOCYTES 

Figures 3A-3B depicts Northern blots of total monocyte 
25 RNA and demonstrates that a 4.8-kb mRNA species is detected 
with the mouse griPGHS 4.1-kb probe. When normalized to the 
hybridization signal for jS-tubulin, griPGHS mRNA levels are 
down-regulated by dexamethasone at 4 hr (5-fold in this 
example), while the level of the 2.8-kb PGHS mRNA is not 
30 affected. In this experiment, the level of * accumulated PGEj 
in the supernatant after 4 hr of incubation was reduced by 
dexamethasone from 122.5 to 52.5 pg per 10 4 monpcytes. In 
another experiment, monocytes treated with IL-10 showed 
increased levels of griPGHS mRNA at 4 hr (2.5-fold relative 
35 to control) and 12 hr (14-fold) (Figures 3A-3B) . These 

increases were significantly blunted when dexamethasone was 
present. Furthermore, the IL-ljff induction and dexamethasone 
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repression of griPGHS mRNA abundance occurred in the presence 
of cycloheximide, where super induct ion of the 4.8-kb mRNA was 
clearly evident (Figures 3A-3B) . In contrast, levels of the 
2.8-kb mRNA were not significantly altered relative to 0- 
5 tubulin by IL-l/J, dexamethasone, or cycloheximide treatment. 

7. EXAMPLE: DRUG ASSAYS USING PGHS-2 TRANSFECTANTS 

The subsections below describe an assay employing the 
PGHS-2 transfectants of the previous example to determine a 
10 test compound's ability to modulate the effects of PGHS-2. 
It is shown that transformed cell lines stably produce 
prostaglandin. In addition, it is shown that several known 
drugs are potent inhibitors of PGHS-2 activity. 

15 7.1. MATERIALS AND METHODS 

7.1.1. EXPRESSION VECTOR CONSTRUCTION 
Following the methodology of Short et al., 1988, Nucleic 
Acids Res., 16:7583-7600, the 4.1 griPGHS cDNA clone was 

20 excised in vivo from the lambda ZAP II vector and the 

resulting griPGHS-Bluescript construct isolated on ampicillin 
plates. griPGHS was prepared for directional subcloning into 
the pRC/CMV expression vector (Invitrogen) by digestion with 
AccI, Klenow fill-in, and digestion with Not I. This 

25 fragment, extending from the Not I site 50 bases upstream of 
the cDNA end to nt 1947 of the cDNA, was isolated by gel 
electrophoresis and contains the full-coding region truncated 
immediately before any S'-AUUUA-S' mRNA destabilizing 
regions. The pRc/CMV vector DNA was digested with Xba I, 

30 filled in with Klenow, then digested with Not I. It was 
further prepared by calf intestinal alkaline phosphatase 
treatment. Ligated pRc/CMV-griPGHS recombinants were 
isolated from ampicillin plates following transformation into 
competent DH5a cells (Library Efficiency; Life Technologies), 

35 and were confirmed by restriction analysis of DNA mini- 
prepgs. The construct is illustrated in Figure 4. 
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7.1.2. TRANSFECTION AND ESTABLISHMENT 
OF STABLE CELL LINES 

Sixty-mm plates of subconfluent COS A2 cells, which 

contain little or no autologous cyclooxygenase activity, were 

5 transfected with 1 or 2.5 /ig of purified griP.GHS-pRC/CMV, or 

the vector alone, by lipofection for 23 hr following the 

manufacturer's directions (Life Technologies). After 2 days 

of growth in normal media (DMEM + 10% fetal bovine serum) , 

transfected cells were switched to media containing 800 fig/ml 

10 of Geneticin (G418, active component 657 /zg/ml; Life 

Technologies) , a concentration previously found to be toxic 
for COS cells. The media was changed every 3 days, and after 
2 weeks, many individual colonies were observed in the dishes 
transfected with either recombinant or vector alone,, but not 

15 in the dishes with no transfected DNA. A total of 36 griPGHS 
pRc/CMV- transfected and 12 vector-transf ected colonies were 
isolated using cloning cylinders • The majority of these 
survived continued selection in 800 iig/ml G418 during clonal 
line expansion. Established cultures are maintained in DMEM 

20 + 10% fetal bovine serum with 400 iig/ml G418. 

7.1.3. DRUG SCREENING STUDIES 
Prostaglandin assays were carried out as described 

above. For drug studies, cells were exposed to various 
25 concentrations of drugs for 30 min in serum-free DMEM and 

arachidonic acid was added directly from a 25x stock in DMEM. 

Supernatants were harvested 15 min later. Controls consisted 

of no drugs and wells treated with maximal concentrations of 

drug vehicles (1% methanol or ethanol) . Drugs were obtained 
30 from Sigma and prepared as 200 mM stock solutions 

(acetaminophen and ibuprofen in methanol, indomethacin in 

ethanol and naproxen in water) . 

7,2. RESULTS 

35 

7.2.1. EXPRESSION VECTORS 
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The pRC/CMV eukaryotic expression vector (Fig. 4) 
provides several distinct advantages. In addition to the 
ease of selection in both bacterial and eukaryotic hosts, 
expression of the present cloned cDNA is driven by a strong 
5 CMV promoter. The vector also provides a poly-A signal that 
is necessary since the present construct does not contain 
griPGHS 3 1 untranslated sequences (it ends 12 base pairs (bp) 
from the translation termination codon) . The removal of 
these sequences is important since in vivo they provide 
10 signals (5 1 -AUUUA-3 ■ ) for rapid mRNA degradation. Finally, 
the vector is well suited for use in COS cells which have 
little or no autologous cyclooxygenase activity. 

7.2.2. CELL LINE CHARACTERIZATION 
15 of the 36 griPGHS-pRc/CMV- and 12 vector alone-cloned 

neomycin resistant colonies, 29 and 9, respectively, were 
tested for PGE2 production. In all cases, vector-alone 
transfectants produced less than 8 /xg of PGE 2 per assay 
(number reflects the amount of PGE 2 secreted after 10 or 15 
20 min in 20 ^1 of collected media) , whereas the griPGHS 
transfected clones showed a wide range of prostaglandin 
production. Of these, eleven prostaglandin-producing and 2 
vector-alone containing clones were further expanded and 
r etested . 

25 The amount of PGE 2 secreted by the clones harboring the 

griPGHS construct varied form 10.6 to 72.2 pg/jxg cell protein 
(Table 3) . 

Table 3 

30 pge 3 production by various cell lines 

pg PGE 2 //xg cell ..protein 

4.4 
1.9 
16.7 
23.6 
46.8 
30.5 
34.2 



Cell Line 



A2 
A5 
El 

35 E-7 

E8 
E9 
Ell 
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transfectants produced less than 8 fig of PGEj per assay 
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20 min in 20 /xl of collected media) , whereas the griPGHS 
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production. Of these, eleven prostaglandin-producing and 2 
vector-alone containing clones were further expanded and 
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25 The amount of PGE 2 secreted by the clones harboring the 

griPGHS construct varied form 10.6 to 72.2 pg/fig cell protein 
(Table 3) . 
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F3 

F4 

F6 

F8 

F14 

F15 



40.0 
10.6 
12.2 
72.1 

3.5** 
16.8 



The values in column 2 represent the amount of 
prostaglandin secreted during a 10 min exposure to 30 j^M 
arachidonic acid and are normalized to total recovered 
cellular protein. Cell lines A2 and A5 contain the vector 
alone and the remaining cells were transfected with griPGHS- 
pRc/CMV. Note that only one (F14, marked by double asterisk, 
«**tt) showed no increase PGE 2 production over cells harboring 
the vector alone. 

Each of these lines was expanded for cryopreservation 
and one (E9) , chosen for ease of culturing and its 
significant PGEj production, was used in further studies. A 
sample of this cell line has been deposited in the American 
Type Culture Collection, Rockville, MD, U.S.A. under the 
provisions of the Budapest Treaty and assigned accession 
number ATCC 11119. 



7.2.3. STABILITY OF PGE ? PRODUCTION 
Stable expression of cyclooxygenase activity in the E9 
cell line was tested by comparing PGE 2 production over at 
least 5 passages of the cell line. After 6 weeks, these 
cells were still producing high levels of PGEj. Although the 
numbers are not directly comparable, since cell numbers were 
not normalized by protein determination in all cases, the 
amount of PGEj secreted by E9 cells in this standard assay 
ranged form 35 pg to 90 pg (per 20 jil assayed media) . 
Furthermore, within an experiment, E9 cells showed very 
consistent levels of PGE2 production from well to well. For 
example, for 12 tested supernatants , PGEj levels were 48.4 ± 
3.5 pg/20 fil (mean ± SEM) . 
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F3 

F4 

F6 

F8 

F14 

F15 



40,0 
10.6 
12*2 
72.1 

3.5** 
16.8 
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7.2.4. DRUG SCREENING STUDIES 
To illustrate the utility of the above described cell 
lines in drug testing, duplicate wells of the E9 cells were 
exposed to a range of doses (0.2 /iM - 2mM) of four non- 
5 steroidal anti- inflammatory drugs: acetaminophen, ibuprofen, 
naproxen, and indomethacin . Cells were placed in serum-free 
medium with the drugs for 30 min prior to a 15 min exposure 
to arachidonic acid (added directly to the media) . 
Synthesized PGEj was then quantitated from the supernatants by 

10 a standard radio immunoassay. Results, shown in Fig. 5, 
reveal specific dose-response curves for each drug with 
indomethacin showing the most and acetaminophen the least 
potency against griPGHS activity. The maximal inhibition in 
each case (except for acetaminophen where 2 mM was apparently 

15 not sufficient for full inhibition) was similar to that seen 
for COS cells harboring the vector alone (3-8 pg) . Low doses 
of each drug gave levels corresponding to the untreated 
control values which averaged at 48.4 pg. Note that controls 
run both with and without 1% drug vehicle (ethanol or 

20 ethanol; comparable to exposure in the 2mM drug conditions) 
showed no differences in PGE 2 production. 



8. EXAMPLE: PREPARATION OP MICROSOMAL 

EXTRACTS AND IN VITRO TESTING OF 
CYCLOOXYGENASE ACTIVITY 

The paragraphs below describe a method for determining 
cellular cyclooxygenase activity by preparing microsomal 
extracts of the cells to be tested and then testing the 
extracts for cyclooxygenase activity. In addition, it is 
shown that the effects of a test compound on cyclooxygenase 
activity can also be determined. 

Microsomal extracts and measurements of cellular 
cyclooxygenase activity are performed essentially as 
described by Raz et al., 1988, J. Biol. Chem. , 263:3022-3025; 
and Raz, et al., 1989, Proc. Nat'l. Acad. Sci. USA, 86:1657- 
1661. Cells are rinsed once with ice-cold PBS (pH=7.4) , 
scraped from dishes with a plastic disposable scraper (Life 
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Technologies) , transferred to 1.5 ml microfuge tubes with 
ice-cold PBS, and pelleted by centrifugation (8 minutes at 
800xg) . The supernatants are removed and the cell pellets 
rinsed with additional PBS. Cell pellets can be stored at - 
5 70°C at this point. 

To prepare extracts, the pellets are resuspended in 
solubilization buffer (50 mM Tris, ImM diethyldithiocarbamic 
acid (sodium salt), 10 mM EDTA, 1% (v/v Tween-20 and 0,2 
mg/ml a 2 -macroglobulin, pH-8.0), followed by sonication (5 x 

10 io sec bursts, low power setting). Extracts are clarified by 
centrifugation at 4°C (20 minutes at 16,000xg). Aliquots are 
taken for protein determination, and 50 /xl aliquots are 
diluted to 500 /xl with a solution containing 100 mM NaCl, 20 
mM sodium borate, 1.5 mM EDTA, 1,5 mM EGTA, 0.3 mM PMSF, 10 

15 mM NEM, 0.5% BSA, 0.5% Triton X-100, ImM epinephrine and ImM 
phenol (pH=9 . 0) . 

Reactions are initiated by the addition of arachidonic 
acid in the above buffer to 100 /iM of microsomal extract and 
incubated for 30 minutes at 37 °c. The PGEj formed is measured 

20 by RIA after quantitative conversion to the methyl oximated 
form as described by the RIA kit manufacturer (Amersham) . To 
test the effects of non-steroidal anti-inflammatory 
compounds, different dosages of drugs are added 5 min prior 
to initiating the reaction with arachidonic acid. 

25 

9. EXAMPLE: ISOLATION, CLONING AND SEQUENCING OP 
HUMAN PGHS-2 

The subsections below describe the identification and 
sequence of human PGHS-2. In addition, it is shown that 
30 transformed cell linies stably express PGHS-1 and PGHS-2. 

9.1. MATERIALS AND METHODS 

9.1.1. GENERATION OP HUMAN PGHS-1 
35 AND HUMAN PGHS-2 CDNA CLONES 

RNA was isolated from a human fibroblast cell line 

(W138) treated with serum and cycloheximide for 4 hr. Total 
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RNA isolation was done by guanidinium lysis followed by CsCl 
cushion centrifugation (Chirgwin et al., 1979* Biochem. , 
18:5294-5299. Polymerase chain reaction (PGR) primers 
specific for the human PGHS-1 and PGHS-2 sequences were 
5 engineered to amplify the coding regions of either one 
transcript or the other (Table 4). The 5« end primers 
contained a Hind III restriction site and the 3 1 end primers 
contained a Not I restriction site for subsequent cloning ♦ 
Reverse transcriptase polymerase chain reactions (RT-PCR) 
10 carried out as described by Kawasaski, 1990 , PGR Protocols: A 
Guide to Methods and Applications, M.A. Innis et al., eds., 
Academic Press, NY, using the specific primers generated PCR 
products about 2kb in size. 

15 Table 4 

PCR Primers 

A. Human PGHS-1 PCR Primers 

NotI 

20 5 1 -CTTACCCGAAGCTTGCGCCATGAGCCGG-3 1 (SEQ ID NO: 10) 

3 • -CGAGACTCCCCGTCGCCGGCGATTGCTT-5 ■ (SEQ ID NO: 11) 

Hindlll 

B. Human PGHS-2 PCR Primers 

NotI 

25 5 1 -TCATTCTAAGCTTCCGCTGCGATGCTCGC-3 1 (SEQ ID NO: 12) 

3 1 -GACATCTTCAGATTACGCCGGCGTACTAG-5 1 (SEQ ID NO: 13) 

Hindlll 



9.1.2. GENERATION OP PLASMID CONSTRUCTS 
FOR TRANSFECTION AND SEQUENCING 

Following purification and digestion with Hindlll and 
NotI, the two PCR products were each ligated into pRC/CMV 
vectors (Invitrogen) (see Figure 4). Ligated pRC/CMV-PGHS 
recombinant plasmids were isolated from ampicillin plates 
following transformation into competent DH5a cells (Life 
Technologies) . Clones were screened for the presence of PGHS 
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15 



inserts by restriction mapping. Three PGHS-2 clones were 
sequenced in both directions on an Applied Biosystems 
automated sequencer Model #373A. 

9.1.3. GENERATION OF STABLY TRANSFECTED 
MAMMALIAN CELL LINES 

Sixty-mm plates of 50% confluent COS-A2 (monkey-kidney) 

cells, which contain little or no cyclooxygenase activity 

were transfected with 1-2.5 /ig of purified pRC/CMV;hPGHS-2 

plasmid, pRC/CMV;hPGHS-l plasmid or the PRC/CMV vector alone 

by a calcium phosphate precipitation method (Chen et al., 

1987, Mol. Cell. Biol., 7:2745-2752. Plates were incubated 

at 35°C, 3% C0 2 for 24 hr in normal media (Dulbecco's modified 

Eagle Media (DMEM) + 10% fetal bovine serum) . After two 

rinses with warm DMEM, plates were transferred to 37 °C, 5% C0 2 

for an additional 24 hr. Selection was then started with 

normal media containing 800 tig/ml of Geneticin (active 

component G418, 657 fig/ml, Life Technologies), a 

concentration which is toxic for COS cells. The media was 

changed every 3 days and after 2 weeks, many individual 

colonies were observed in the dishes transfected with either 

recombinant PGHS vector or vector alone, but not in the 

dishes with no transfected DNA. Twelve to twenty-four 

colonies from each transfection were isolated using cloning 

cylinders. The majority of these survived continued G418 

selection during clonal cell-line expansion. Established 

cultures are maintained in DMEM + 10% fetal bovine serum with 

400 lig/nl G418. 

9.1.4. TESTING THE G418 RESISTANT 
30 CELL LINES AND CONFIRMING 

THE STABLE EXPRESSION OF PGHS-2 
AND PGHS-1 ACTIVITY 

Transfected COS cells plated in 12-well plates were 

grown to near confluence, rinsed twice with warm serum-free 

media and then covered with 300 pi of media containing 30 /xM 

35 

arachidonic acid (sodium salt; Sigma) . After 15 min, 
supernatants were placed in Eppendorf tubes on ice, clarified 
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by centrif ligation at 15,000 x g for 2 min, and assayed for 
PGE production by immunoassay alter conversion to the methyl 
oximated form (kit from Amersham) . 

Cell monolayers were solubilized in 0.5 M NaOH and 
5 neutralized with 1 M HC1 for protein concentration deter- 
minations using reagents from BioRad (modified Bradford 
Assay) . Cell lines expressing PGHS activity were further 
expanded and then frozen down in media with 10% DMS0. 

10 9.2. RESULTS 

9.2. 1. SEQUENCE OF HUMAN PGHS -2 

The clone comprising the PGHS-2 gene sequence depicted 
in Figures 6A-6B was selected for transf ection. This 

15 sequence differs from the human PGHS-2 sequence disclosed by 
Hla and Neilson, 1992 f Proc. Nat'l. Acad. Sci. USA, 89:7384- 
7388, due to a glutamic acid (E) rather than a glycine (w) at 
amino acid position 165 of the PGHS-2 gene product (Figure 
7) . The sequence for the PGHS-2 gene was confirmed by 

20 establishing the identity of the sequences of two other 
hPGHS-2 clones obtained from separate PCR runs, which 
demonstrates that the difference observed is not a PCR 
artifact. Furthermore, as shown in Figure 1, mouse PGHS-2 
also has a glutamic acid at this position. PGHS-1 clones 

25 were similarly screened and the sequences of the PGHS-1 gene 
and enzyme confirmed to be identical to that shown in Figure 
2 (SEQ ID NO: 6) in Yokahama and Tanabe, 1984 Biochem. 
Biophys. Res. Commun., 165:888-894; see also, Hla, 1986, 
Prostaglandins, 32:829 -8 45. 

30 

9.2.2. TRANSFORMED CELL LINES STABLY 
EXPRESSED PGHS-1 AND PGHS-2 

Cell line 4B4 expressing PGHS-2 and cell line H17A5 
expressing PGHS-1 were deposited on March 5, 1993 in the 
35 American Type Culture Collection, Rockville, MD, USA (cell 
line 4B4 was assigned ATCC accession number CRL 11284; cell 
line H17A5 was assigned ATCC CRL 11283) . 
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Levels of PGHS expression in the stably transformed cell 
lines varied and were much higher for PGHS-1 cell lines in 
comparison to PGHS-2 cell lines, as shown by the data in 
Table 5* 

5 Table S 

PGEj Production in Stably Transformed COS Cell Lines 



Human PGHS-1 Cell Lines Human PGHS-2 Cell Lines 



fDRC/CMV.-hPGHS-l) 


( P RC/CMV;hPGHS- 


-1) 


10 Line 


Level' 


Line 


Level* 


H17A1 


0.4 


2A2 


5.5 


H17A3 


2500 


2B1 


4.0 


H17A5* 


2500+ 


2B2 


37.5 


H17A6 


73.5 


2B3 


31.6 


H17B3 


145 


2B6 


29.0 


15 H22A2 


2036 


4A1 


36.2 


H22A5 


40.3 


4A2 


0.4 


H22B2 


73.5 


4A3 


0.6 


H22B3 


568 


4A4 


8.2 


H22B4 


9.2 


4A5 
4A6 
4B1 


9.8 
7.2 
24.6 


20 




4B2 


4.8 




4B3 

4B4* 

4B5 


13.1 
58.0 
10.6 



* Pg PGEj/15 min//xg cellular protein; COS-A2 = 0.4; COS-A2 
+ pRC/CMV vector = 0.4 



.25 

The cell lines have maintained high levels of PGHS 
expression even after many months of culturing. For example, 
the cell line 4B4 has been tested 6 times over 5 months and 
expression has ranged from 50-60 pg PG^/^g cellular protein. 
30 The exclusive presence of either PGHS-1 or PGHS-2 in the cell 
lines was confirmed by Northern analyses using hybridization 
probes that are specific for either PGHS-1 or P.GHS-2. 
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10. EXAMPLE: NONSTEROIDAL ANTI -INFLAMMATORY 

DRUG (NSAID) STUDIES ON STABLE HUMAN 
PGHS-1 AND PGHS-2 CELL LINES 

The text below describes the effects of various 
concentrations of NSAID on the ability of PGHS-1 and PGHS-2 
5 cell lines to produce prostaglandin. 

PGHS-1 and PGHS-2 cell lines (including 4B4 and H17A5) 
were exposed to various concentrations of NSAID for 30 min in 
serum- free DMEM. Arachidonic acid was added directly from a 
25x stock in DMEM and supernatant s were harvested 15 min 
10 later. Controls consisted of no drug treatment and cells 
treated with the maximal concentrations of drug vehicles (1% 
methanol or ethanol) . Drugs were obtained from Sigma Chem. 
Co. and prepared as 200 mM stock solutions (aspirin and 
ibuprofen in methanol, indomethacin in ethanol, and naproxen 
in water) . Cyclooxygenase activity was determined as 
described herein above. Distinctly different dose-response 
curves that were characteristic for either the PGHS-1 or 
PGHS-2 cell lines were observed. Particularly as shown in 
Figures 8A-8D and 9A-9D for indomethacin and aspirin, the 
20 levels of drug required for inhibition were different for the 
cells expressing PGHS-1 versus those expressing PGHS-2 
(Figures 8A-8D and 9A-9D) . 

All publications, patents and patent applications are 
herein incorporated by reference to the same extent as if 
25 each individual publication or patent application was 

specifically and individually indicated to be incorporated by 
reference. 

The present invention is not to be limited in scope by 
the specific embodiments described herein, which are intended 

30 as single illustrations of individual aspects of the 

invention, and functionally equivalent methods and components 
are within the scope of the invention. Indeed, various 
modifications of the invention, in addition to those shown 
and described herein will become apparent to those skilled in 

35 the art from the foregoing description and accompanying 
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10. EXAMPLE : NONSTEROIDAL ANT I -INFLAMMATORY 

DRUG (NSAID) STUDIES ON STABLE HUMAN 
PGHS-1 AND PGHS-2 CELL LINES 

The text below describes the effects of various 
concentrations of NSAID on the ability of PGHS-1 and PGHS-2 
5 cell lines to produce prostaglandin. 

PGHS-1 and PGHS-2 cell lines (including 4B4 and H17A5) 
were exposed to various concentrations of NSAID for 30 min in 
serum- free DMEM. Arachidonic acid was added directly from a 
25x stock in DMEM and supernatant s were harvested 15 min 
10 later. Controls consisted of no drug treatment and cells 
treated with the maximal concentrations of drug vehicles (1% 
methanol or ethanol) . Drugs were obtained from Sigma Chem. 
Co. and prepared as 200 mM stock solutions (aspirin and 
ibuprofen in methanol, indomethacin in ethanol, and naproxen 
in water) . Cyclooxygenase activity was determined as 
described herein above. Distinctly different dose-response 
curves that were characteristic for either the PGHS-1 or 
PGHS-2 cell lines were observed. Particularly as shown in 
Figures 8A-8D and 9A-9D for indomethacin and aspirin, the 
levels of drug required for inhibition were different for the 
cells expressing PGHS-1 versus those expressing PGHS-2 
(Figures 8A-8D and 9A-9D) . 

All publications, patents and patent applications are 
herein incorporated by reference to the same extent as if 
each individual publication or patent application was 
specifically and individually indicated to be incorporated by 
reference. 

The present invention is not to be limited in scope by 
the specific embodiments described herein, which are intended 

30 as single illustrations of individual aspects of the 

invention, and functionally equivalent methods and components 
are within the scope of the invention. Indeed, various 
modifications of the invention, in addition to those shown 
and described herein will become apparent to those skilled in 

35 the art from the foregoing description and accompanying 
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drawings. Such modifications are intended to fall within the 
scope of the appended claims. 

11. DEPOSIT OF MICROORGANISMS 
5 The following microorganisms have been deposited with 

the American Type Culture Collection, (ATCC) , Rockville, 
Maryland and have been assigned the following accession 
numbers : 
Micr oor gan i sm 

Q Strain Designation Date of Deposit Accession No. 

A1.2 p5 2/20/95 June 7, 1995 

hPGHS-2 A2.7 p6 11/3/93 June 7, 1995 
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drawings. Such modifications are intended to fall within the 
scope of the appended claims. 

11. DEPOSIT OF MICROORGANISMS 
5 The following microorganisms have been deposited with 

the American Type Culture Collection, (ATCC) , Rockville, 
Maryland and have been assigned the following accession 
numbers : 
Microorganism 

Q strain Designation Date of Deposit Accession No, 

A1.2 p5 2/20/95 June 7, 1995 

hPGHS-2 A2.7 p6 11/3/93 June 7, 1995 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Young , Donald A. 

O'Banion, Michael K. 
Winn, Virginia D. 

(ii) TITLE OF INVENTION: PRODUCTION OF MAMMALIAN PROSTAGLANDIN 
H SYNTHASE-2 

(iii) NUMBER OF SEQUENCES: 18 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pennie & Edmonds 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 10036-2711 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER*. IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/480,071 

(B) FILING DATE: 07-JUN-1995 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Coruzzi, Laura A. 

(B) REGISTRATION NUMBER: 30,742 

(C) REFERENCE /DOCKET NUMBER: 3996-010 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 790-9090 

(B) TELEFAX: (212) 869-9741/8864 

(C) TELEX: 66141 PENNIE 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1920 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 101.. 1912 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CTTCAGGAGT CAGTJCAGGAC TCTGCTCACG AAGGAACTCA GCACTGCATC CTGCCAGCTC 

CACCGCCACC ACTACTGCCA CCTCCGCTGC CACCTCTGCG ATG CTC TTC CGA GCT 

Met Leu Phe Arg Ala 
1 5 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Young, Donald A. 

O'Banion, Michael K. 
Winn, Virginia D. 

(ii) TITLE OP INVENTION: PRODUCTION OP MAMMALIAN PROSTAGLANDIN 
H SYNTHASE-2 

(iii) NUMBER OF SEQUENCES: 18 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pennie & Edmonds 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 10036-2711 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER-:. IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/480,071 

(B) FILING DATE: 07-JUN-1995 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Coruzzi, Laura A. 

(B) REGISTRATION NUMBER: 30,742 

(C) REFERENCE /DOCKET NUMBER: 3996-010 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 790-9090 

(B) TELEFAX: (212) 869-9741/8864 

(C) TELEX: 66141 PENNIE 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1920 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 101.. 1912 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
CTTCAGGAGT CAGIJCAGGAC TCTGCTCACG AAGGAACTCA GCACTGCATC CTGCCAGCTC 
CACCGCCACC ACTACTGCCA CCTCCGCTGC CACCTCTGCG ATG CTC TTC CGA GCT 

Met Leu Phe Arg Ala 
1 5 
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( GTG CTG CTC TGC GCT GCC CTG GGG CTC AGC CAG GCA GCA AAT CCT TGC 163 
Val Leu Leu Cys Ala Ala Leu Gly Leu Ser Gin Ala Ala Asn Pro Cys 
10 15 20 

TGT TCC AAT CCA TGT CAA AAC CGT GGG GAA TGT ATG AGC ACA GGA TTT 211 
Cys. Ser Asn Pro Cys Gin ABn Arg Gly Glu Cys Met Ser Thr Gly Phe 
25 30 35 

GAC CAG TAT AAG TGT GAC TGT ACC CGG ACT GGA TTC TAT GGT GAA AAC 259 
Asp Gin Tyr Lye Cys Asp Cys Thr Arg Thr Gly Phe Tyr Gly Glu Asn 
* 40 45 50 

TGT ACT ACA CCT GAA TTT CTG ACA AGA ATC AAA TTA CTG CTG AAG CCC 307 
Cys Thr Thr Pro Glu Phe Leu Thr Arg lie Lys Leu Leu Leu Lys Pro 
55 60 65 

ACC CCA AAC ACA GTG CAC TAC ATC CTG ACC CAC TTC AAG GGA GTC TGG 355 
Thr Pro Asn Thr Val His Tyr lie Leu Thr His Phe Lys Gly Val Trp 
70 75 80 85 

AAC ATT GTG AAC AAC ATC CCC TTC CTG CGA AGT TTA ATC ATG AAA TAT 403 
Asn He Val Asn Asn He Pro Phe Leu Arg Ser Leu He Met Lys Tyr 
90 95 100 

GTG CTG ACA TCC AGA TCA TAT TTG ATT GAC AGT CCA CCT ACT TAC AAT 451 
Val Leu Thr Ser Arg Ser Tyr Leu He Asp Ser Pro Pro Thr Tyr Asn 
105 HO 115 

GTG CAC TAT GGT TAC AAA AGC TGG GAA GCC TTC TCC AAC CTC TCC TAC 499 
Val His Tyr Gly Tyr Lys Ser Trp Glu Ala Phe Ser Asn Leu Ser Tyr 
120 125 130 

TAC ACC AGG GCC CTT CCT CCC GTA GCA GAT GAC TGC CCA ACT CCC ATG 547 
Tyr Thr Arg Ala Leu Pro Pro Val Ala Asp Asp Cys Pro Thr Pro Met 
135 140 145 

GGT GTG AAG GGA AAT AAG GAG CTT CCT GAT TCA AAA GAA GTG CTG GAA 595 
Gly Val Lys Gly Asn Lys Glu Leu Pro Asp Ser Lys Glu Val Leu Glu 
150 155 160 165 

AAG GTT CTT CTA CGG AGA GAG TTC ATC CCT GAC CCC CAA GGC TCA AAT 643 
Lys Val Leu Leu Arg Arg Glu Phe He Pro Asp Pro Gin Gly Ser Asn 
170 175 180 

ATG ATG TTT GCA TTC TTT GCC CAG CAC TTC ACC CAT CAG TTT TTC AAG 691 
Met Met Phe Ala Phe Phe Ala Gin His Phe Thr His Gin Phe Phe Lys 
185 190 195 

ACA GAT CAT AAG CGA GGA CCT GGG TTC ACC CGA GGA CTG GGC CAT GGA 739 
Thr Asp His Lys Arg Gly Pro Gly Phe Thr Arg Gly Leu Gly His Gly 
200 " 205 210 

GTG GAC TTA AAT CAC ATT TAT GGT GAA ACT CTG GAC AGA CAA CAT AAA 787 
Val Asp Leu Asn His He Tyr Gly Glu Thr Leu Asp Arg Gin His Lys 
215 220 225 

CTG CGC CTT TTC AAG GAT GGA AAA TTG AAA TAT CAG GTC ATT GGT- GGA 835 
Leu Arg Leu Phe Lys Asp Gly Lys Leu Lys Tyr Gin Val He Gly Gly 
230 235 240 245 

GAG GTG TAT CCC CCC ACA GTC AAA GAC ACT CAG GTA GAG ATG ATC TAC 883 
Glu Val Tyr Pro Pro Thr Val Lys Asp Thr Gin Val Glu Met He Tyr 
250 255 260 

CCT CCT CAC ATC CCT GAG AAC CTG CAG TTT GCT GTG GGG CAG GAA GTC 931 
Pro Pro His He Pro Glu Asn Leu Gin Phe Ala Val Gly Gin Glu Val 
265 270 275 
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GTG CTG CTC TGC GCT GCC CTG GGG CTC AGC CAG GCA GCA AAT CCT TGC 163 
Val Leu Leu Cys Ala Ala Leu Gly Leu Ser Gin Ala Ala Asn Pro Cys 
10 15 20 

TGT TCC AAT CCA TGT CAA AAC CGT GGG GAA TGT ATG AGC ACA GGA TTT 211 
Cys. Ser Asn Pro Cys Gin Asn Arg Gly Glu Cys Met Ser Thr Gly Phe 
25 30 35 

GAC CAG TAT AAG TGT GAC TGT ACC CGG ACT GGA TTC TAT GGT GAA AAC 259 
Asp Gin Tyr Lye Cys Asp Cys Thr Arg Thr Gly Phe Tyr Gly Glu Asn 
40 " 45 50 

TGT ACT ACA CCT GAA TTT CTG ACA AGA ATC AAA TTA CTG CTG AAG CCC 307 
Cys Thr Thr Pro Glu Phe Leu Thr Arg lie Lys Leu Leu Leu Lys Pro 
55 60 65 

ACC CCA AAC ACA GTG CAC TAC ATC CTG ACC CAC TTC AAG GGA GTC TGG 355 
Thr Pro Asn Thr Val His Tyr lie Leu Thr His Phe Lys Gly Val Trp 
70 75 80 85 

AAC ATT GTG AAC AAC ATC CCC TTC CTG CGA AGT TTA ATC ATG AAA TAT 403 
Asn lie Val Asn Asn He Pro Phe Leu Arg Ser Leu He Met Lys Tyr 
90 95 100 

GTG CTG ACA TCC AGA TCA TAT TTG ATT GAC AGT CCA CCT ACT TAC AAT 451 
Val Leu Thr Ser Arg Ser Tyr Leu He Asp Ser Pro Pro Thr Tyr Asn 
105 HO 115 

GTG CAC TAT GGT TAC AAA AGC TGG GAA GCC TTC TCC AAC CTC TCC TAC 499 
Val His Tyr Gly Tyr Lys Ser Trp Glu Ala Phe Ser Asn Leu Ser Tyr 
120 ~ 125 130 

TAC ACC AGG GCC CTT CCT CCC GTA GCA GAT GAC TGC CCA ACT CCC ATG 547 
Tyr Thr Arg Ala Leu Pro Pro Val Ala Asp Asp Cys Pro Thr Pro Met 
135 140 145 

GGT GTG AAG GGA AAT AAG GAG CTT CCT GAT TCA AAA GAA GTG CTG GAA 595 
Gly Val Lys Gly Asn Lys Glu Leu Pro Asp Ser Lys Glu Val Leu Glu 
150 155 160 165 

AAG GTT CTT CTA CGG AGA GAG TTC ATC CCT GAC CCC CAA GGC TCA AAT 643 
Lys Val Leu Leu Arg Arg Glu Phe He Pro Asp Pro Gin Gly Ser Asn 
170 175 180 

ATG ATG TTT GCA TTC TTT GCC CAG CAC TTC ACC CAT CAG TTT TTC AAG 691 
Met Met Phe Ala Phe Phe Ala Gin His Phe Thr His Gin Phe Phe Lys 
185 190 195 

ACA GAT CAT AAG CGA GGA CCT GGG TTC ACC CGA GGA CTG GGC CAT GGA 739 
Thr Asp His Lys Arg Gly Pro Gly Phe Thr Arg Gly Leu Gly HiB Gly 
200 205 210 

GTG GAC TTA AAT CAC ATT TAT GGT GAA ACT CTG GAC AGA CAA CAT AAA 787 
Val Asp Leu Asn His He Tyr Gly Glu Thr Leu Asp Arg Gin His Lys 
215 220 225 

CTG CGC CTT TTC AAG GAT GGA AAA TTG AAA TAT CAG GTC ATT GGT--- GGA 835 
Leu Arg Leu Phe Lys Asp Gly Lys Leu Lys Tyr Gin Val He Gly Gly 
230 235 240 245 

GAG GTG TAT CCC CCC ACA GTC AAA GAC ACT CAG GTA GAG ATG ATC TAC 883 
Glu Val Tyr Pro Pro Thr Val Lys Asp Thr Gin Val Glu Met He Tyr 
250 255 260 

CCT CCT CAC ATC CCT GAG AAC CTG CAG TTT GCT GTG GGG CAG GAA GTC 931 
Pro Pro His He Pro Glu Asn Leu Gin Phe Ala Val Gly Gin Glu Val 
265 270 275 
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TTT GGT CTG GTG CCT GGT CTG ATG ATG TAT GCC ACC ATC TGG CTT CGG 979 
Phe Gly Leu Val Pro Gly Leu Met Met Tyr Ala Thr He Trp Leu Arg 
280 285 290 

GAG CAC AAC AGA GTG TGC GAC ATA CTC AAG CAG GAG CAT CCT GAG TGG 1027 
Glu His Asn Arg Val Cys Asp He Leu Lys Gin Glu Hie Pro Glu Trp 
295 300 305 

GGT GAT GAG CAA CTA TTC CAA ACC AGC AGA CTC ATA CTC ATA GGA GAG 1075 
Gly Abo Glu Gin Leu Phe Gin Thr Ser Arg Leu He Leu He Gly Glu 
310 315 320 325 

ACT ATC AAG ATA GTG ATC GAA GAC TAC GTG CAA CAC CTG AGC GGT TAC 1123 
Thr He LyB He Val He Glu Asp Tyr Val Gin His Leu Ser Gly Tyr 
330 335 340 

CAC TTC AAA CTC AAG TTT GAC CCA GAG CTC CTT TTC AAC CAG CAG TTC 1171 
His Phe Lys Leu Lys Phe Asp Pro Glu Leu Leu Phe Asn Gin Gin Phe 
345 350 355 

CAG TAT CAG AAC CGC ATT GCC TCT GAA TTC AAC ACA CTC TAT CAC TGG 1219 
Gin Tyr Gin Asn Arg He Ala Ser Glu Phe Asn Thr Leu Tyr His Trp 
360 365 370 

CAC CCC CTG CTG CCC GAC ACC TTC AAC ATT GAA GAC CAG GAG TAC AGC 1267 
His Pro Leu Leu Pro Asp Thr Phe Asn He Glu Asp Gin Glu Tyr Ser 
375 380 385 

TTT AAA CAG TTT CTC TAC AAC AAC TCC ATC CTC CTG GAA CAT GGA CTC 1315 
Phe Lys Gin Phe Leu Tyr Asn Asn Ser He Leu Leu Glu His Gly Leu 
390 395 400 405 

ACT CAG TTT GTT GAG TCA TTC ACC AGA CAG ATT GCT GGC CGG GTT GCT 1363 
Thr Gin Phe Val Glu Ser Phe Thr Arg Gin lie Ala Gly Arg Val Ala 
410 415 420 

GGG GGA AGA AAT GTG CCA ATT GCT GTA CAA GCA GTG GCA AAG GCC TCC 1411 
Gly Gly Arg Asn Val Pro He Ala Val Gin Ala Val Ala LyB Ala Ser 
y 425 430 435 

ATT GAC CAG AGC AGA GAG ATG AAA TAC CAG TCT CTC AAT GAG TAC CGG 1459 
He Asp Gin Ser Arg Glu Met Lys Tyr Gin Ser Leu Asn Glu Tyr Arg 
* 440 445 450 

AAA CGC TTC TCC CTG AAG CCG TAC ACA TCA TTT GAA GAA CTT ACA GGA 1507 
Lys Arg Phe Ser Leu Lys Pro Tyr Thr Ser Phe Glu Glu Leu Thr Gly 
455 460 465 

GAG AAG GAA ATG GCT GCA GAA TTG AAA GCC CTC TAC AGT GAC ATC GAT 1555 
Glu Lys Glu Met Ala Ala Glu Leu Lys Ala Leu Tyr Ser Asp He Asp 
470 475 480 485 

GTC ATG GAA CTG TAC CCT GCC CTG CTG GTG GAA AAA CCT CGT CCA GAT 1603 
Val Met Glu Leu Tyr Pro Ala Leu Leu Val Glu Lys Pro Arg Pro Asp 
490 495 500 

GCT ATC TTT GGG GAG ACC ATG GTA GAG CTT GGA GCA CCA TTC TCC TTG 1651 
Ala He Phe Gly Glu Thr Met Val Glu Leu Gly Ala Pro Phe Ser Leu 
505 510 515 

AAA GGA CTT ATG GGA AAT CCC ATC TGT TCT CCT CAA TAC TGG AAG CCG 1699 
Lys Gly Leu Met Gly Asn Pro He Cys Ser Pro Gin Tyr Trp Lys Pro 
* 520 525 530 

AGC ACC TTT GGA GGC GAA GTG GGT TTT AAG ATC ATC AAT ACT GCC TCA 1747 
Ser Thr Phe Gly Gly Glu Val Gly Phe Lys He He Asn Thr Ala Ser 
535 540 545 
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TTT GGT CTG GTG CCT GGT CTG ATG ATG TAT GCC ACC ATC TGG CTT CGG 979 
Phe Gly Leu Val Pro Gly Leu Met Met Tyr Ala Thr lie Trp Leu Arg 
280 285 290 

GAG CAC AAC AGA GTG TGC GAC ATA CTC AAG CAG GAG CAT CCT GAG TGG 1027 
Glu His Asn Arg Val Cys Asp lie Leu Lye Gin Glu His Pro Glu Trp 
295 300 305 

GGT GAT GAG CAA CTA TTC CAA ACC AGC AGA CTC ATA CTC ATA GGA GAG 1075 
Gly Asp Glu Gin Leu Phe Gin Thr Ser Arg Leu lie Leu He Gly Glu 
310 315 320 325 

ACT ATC AAG ATA GTG ATC GAA GAC TAC GTG CAA CAC CTG AGC GGT TAC 1123 
Thr He Lys He Val He Glu Asp Tyr Val Gin His Leu Ser Gly Tyr 
330 335 340 

CAC TTC AAA CTC AAG TTT GAC CCA GAG CTC CTT TTC AAC CAG CAG TTC 1171 
His Phe Lys Leu Lys Phe Asp Pro Glu Leu Leu Phe Asn Gin Gin Phe 
345 350 355 

CAG TAT CAG AAC CGC ATT GCC TCT GAA TTC AAC ACA CTC TAT CAC TGG 1219 
Gin Tyr Gin Asn Arg He Ala Ser Glu Phe Asn Thr Leu Tyr His Trp 
360 " 365 370 

CAC CCC CTG CTG CCC GAC ACC TTC AAC ATT GAA GAC CAG GAG TAC AGC 1267 
His Pro Leu Leu Pro Asp Thr Phe Asn He Glu Asp Gin Glu Tyr Ser 
375 380 385 

TTT AAA CAG TTT CTC TAC AAC AAC TCC ATC CTC CTG GAA CAT GGA CTC 1315 
Phe Lva Gin Phe Leu Tyr Asn Asn Ser He Leu Leu Glu His Gly Leu 
390 395 400 405 

ACT CAG TTT GTT GAG TCA TTC ACC AGA CAG ATT GCT GGC CGG GTT GCT 1363 
Thr Gin Phe Val Glu Ser Phe Thr Arg Gin He Ala Gly Arg Val Ala 
410 415 420 

GGG GGA AGA AAT GTG CCA ATT GCT GTA CAA GCA GTG GCA AAG GCC TCC 1411 
Gly Gly Arg Asn Val Pro He Ala Val Gin Ala Val Ala Lys Ala Ser 
425 430 435 

ATT GAC CAG AGC AGA GAG ATG AAA TAC CAG TCT CTC AAT GAG TAC CGG 1459 
He Asp Gin Ser Arg Glu Met Lys Tyr Gin Ser Leu Asn Glu Tyr Arg 
* 440 445 450 

AAA CGC TTC TCC CTG AAG CCG TAC ACA TCA TTT GAA GAA CTT ACA GGA 1507 
Lys Arg Phe Ser Leu Lys Pro Tyr Thr Ser Phe Glu Glu Leu Thr Gly 
* 455 460 465 

GAG AAG GAA ATG GCT GCA GAA TTG AAA GCC CTC TAC AGT GAC ATC GAT 1555 
Glu Lys Glu Met Ala Ala Glu Leu Lys Ala Leu Tyr Ser Asp He Asp 
470 475 480 485 

GTC ATG GAA CTG TAC CCT GCC CTG CTG GTG GAA AAA CCT CGT CCA GAT 1603 
Val Met Glu Leu Tyr Pro Ala Leu Leu Val Glu Lys Pro Arg Pro Asp 
490 495 500 

GCT ATC TTT GGG GAG ACC ATG GTA GAG CTT GGA GCA CCA TTC TCC TTG 1651 
Ala He Phe Gly Glu Thr Met Val Glu Leu Gly Ala Pro Phe Ser Leu 
505 510 515 

AAA GGA CTT ATG GGA AAT CCC ATC TGT TCT CCT CAA TAC TGG AAG CCG 1699 
Lys Gly Leu Met Gly Asn Pro He Cys Ser Pro Gin Tyr Trp Lys Pro 
520 525 530 

AGC ACC TTT GGA GGC GAA GTG GGT TTT AAG ATC ATC AAT ACT GCC TCA 1747 
Ser 5hr Phe Tly Gly Glu Val Gly Phe Lys He lie Asn Thr Ala Ser 
535 ~ 540 545 
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ATT CAG TCT CTC ATC TGC AAT AAT GTG AAG GGG TGT CCC TTC ACT TCT 1795 
lie Gin Ser Leu lie Cys Asn Asn Val Lys Gly Cys Pro Phe Thr Ser 
550 555 560 565 

TTC AAT GTG CAA GAT CCA CAG CCT ACC AAA ACA GCC ACC ATC AAT GCA 1843 
Phe Aen Val Gin Asp Pro Gin Pro Thr Lys Thr Ala Thr lie Asn Ala 
570 575 580 

AGT GCC TCC CAC TCC AGA CTA GAT GAC ATT AAC CCT ACA GTA CTA ATC 1891 
Ser Ala Ser His Ser Arg Leu Asp Asp He Asn Pro Thr Val Leu He 
585 " 590 595 

AAA AGG CGT TCA ACT GAG CTG TAAAAGTC 1920 
Lys Arg Arg Ser Thr Glu Leu 
600 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Leu Phe Arg Ala Val Leu Leu Cys Ala Ala Leu Gly Leu Ser Gin 
15 10 15 

Ala Ala Asn Pro Cys Cys Ser Asn Pro Cys Gin Asn Arg Gly Glu Cys 
20 25 30 

Met Ser Thr Gly Phe Asp Gin Tyr Lys Cys Asp Cys Thr Arg Thr Gly 
35 40 45 

Phe Tyr Gly Glu Asn Cys Thr Thr Pro Glu Phe Leu Thr Arg He Lys 
50 55 60 

Leu Leu Leu Lys Pro Thr Pro Asn Thr Val His Tyr He Leu Thr His 
65 70 75 80 

Phe Lys Gly Val Trp Asn He Val Asn Asn He Pro Phe Leu Arg Ser 
85 90 95 

Leu He Met Lys Tyr Val Leu Thr Ser Arg Ser Tyr Leu He Asp Ser 
100 105 HO 

Pro Pro Thr Tyr Asn Val His Tyr Gly Tyr Lys Ser Trp Glu Ala Phe 
115 120 125 

Ser Asn Leu Ser Tyr Tyr Thr Arg Ala Leu Pro Pro Val Ala Asp Asp 
130 * 135 140 

Cys Pro Thr Pro Met Gly Val Lys Gly ABn Lys Giu Leu Pro Asp Ser 
145 150 155 160 

Lvs Glu Val Leu Glu Lys Val Leu Leu Arg Arg Glu Phe He Pro Asp 
165 170 175 

Pro Gin Gly Ser Asn Met Met Phe Ala Phe Phe Ala Gin HiB Phe Thr 
180 185 190 

His Gin Phe Phe Lys Thr Asp His Lys Arg Gly Pro Gly Phe Thr Arg 
195 200 205 
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ATT CAG TCT CTC ATC TGC AAT AAT GTG AAG GGG TGT CCC TTC ACT TCT 1795 
lie Gin Ser Leu lie Cys Asn Asn Val Lys Gly Cys Pro Phe Thr Ser 
550 555 560 565 

TTC AAT GTG CAA GAT CCA CAG CCT ACC AAA ACA GCC ACC ATC AAT GCA 1843 
Phe Aan Val Gin Asp Pro Gin Pro Thr Lys Thr Ala Thr lie Asn Ala 
570 575 580 

AGT GCC TCC CAC TCC AGA CTA GAT GAC ATT AAC CCT ACA GTA CTA ATC 1891 
Ser Ala Ser His Ser Arg Leu Asp Asp lie Asn Pro Thr Val Leu lie 
585 " 590 595 

AAA AGG CGT TCA ACT GAG CTG TAAAAGTC 1920 
Lys Arg Arg Ser Thr Glu Leu 
600 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Leu Phe Arg Ala Val Leu Leu Cys Ala Ala Leu Gly Leu Ser Gin 
1 5 10 15 

Ala Ala Asn Pro Cys Cys Ser Asn Pro Cys Gin Asn Arg Gly Glu Cys 
20 25 30 

Met Ser Thr Gly Phe Asp Gin Tyr Lys Cys Asp Cys Thr Arg Thr Gly 
35 40 45 

Phe Tyr Gly Glu Asn Cys Thr Thr Pro Glu Phe Leu Thr Arg lie Lys 
50 55 60 

Leu Leu Leu Lys Pro Thr Pro Asn Thr Val His Tyr lie Leu Thr His 
65 70 75 80 

Phe Lys Gly Val Trp Asn lie Val Asn Asn lie Pro Phe Leu Arg Ser 
85 90 95 

Leu lie Met Lys Tyr Val Leu Thr Ser Arg Ser Tyr Leu lie Asp Ser 
100 105 110 

Pro Pro Thr Tyr Asn Val His Tyr Gly Tyr Lys Ser Trp Glu Ala Phe 
115 120 125 

Ser Asn Leu Ser Tyr Tyr Thr Arg Ala Leu Pro Pro Val Ala Asp Asp 
130 " 135 140 

Cys Pro Thr Pro Met Gly Val Lys Gly Asn Lys Giu Leu Pro Asp Ser 
145 150 155 160 

Lys Glu Val Leu Glu Lys Val Leu Leu Arg Arg Glu Phe lie Pro Asp 
165 170 175 

Pro Gin Gly Ser Asn Met Met Phe Ala Phe Phe Ala Gin His Phe Thr 
180 185 190 

His Gin Phe Phe Lys Thr Asp His Lys Arg Gly Pro Gly Phe Thr Arg 
195 200 205 
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Gly Leu Gly His Gly Val Asp Leu Asn His lie Tyr Gly Glu Thr Leu 
210 "* 215 220 

Asp Arg Gin His Lys Leu Arg Leu Phe Lys Asp Gly Lys Leu Lys Tyr 
225 230 235 240 

Gin Val lie Gly Gly Glu Val Tyr Pro Pro Thr Val Lys Asp Thr Gin 
245 250 255 

Val Glu Met lie Tyr Pro Pro His He Pro Glu Asn Leu Gin Phe Ala 
260 265 270 

Val Gly Gin Glu Val Phe Gly Leu Val Pro Gly Leu Met Met Tyr Ala 
275 280 285 

Thr He Trp Leu Arg Glu His Asn Arg Val Cys Asp He Leu Lys Gin 
290 295 300 

Glu His Pro Glu Trp Gly Asp Glu Gin Leu Phe Gin Thr Ser Arg Leu 
305 310 315 320 

He Leu He Gly Glu Thr He Lys He Val He Glu Asp Tyr Val Gin 
325 330 335 

His Leu Ser Gly Tyr His Phe Lys Leu Lys Phe Asp Pro Glu Leu Leu 
340 345 350 

Phe Asn Gin Gin Phe Gin Tyr Gin Asn Arg He Ala Ser Glu Phe Asn 
355 360 365 

Thr Leu Tyr His Trp His Pro Leu Leu Pro Asp Thr Phe Asn He Glu 
370 375 380 

Asp Gin Glu Tyr Ser Phe Lys Gin Phe Leu Tyr Asn Asn Ser He Leu 
385 * 390 395 400 

Leu Glu His Gly Leu Thr Gin Phe Val Glu Ser Phe Thr Arg Gin He 
405 410 415 

Ala Gly Arg Val Ala Gly Gly Arg Asn Val Pro He Ala Val Gin Ala 
420 ~ 425 430 

Val Ala Lys Ala Ser He Asp Gin Ser Arg Glu Met Lys Tyr Gin Ser 
435 440 445 

Leu Asn Glu Tyr Arg Lys Arg Phe Ser Leu Lys Pro Tyr Thr Ser Phe 
450 * 455 460 

Glu Glu Leu Thr Gly Glu Lys Glu Met Ala Ala Glu Leu Lys Ala Leu 
465 470 475 480 

Tyr Ser Asp He Asp Val Met Glu Leu Tyr Pro Ala Leu Leu Val Glu 
485 490 495 

Lys Pro Arg Pro Asp Ala He Phe Gly Glu Thr Met Val Glu Leu Gly 
500 505 510 

Ala Pro Phe Ser Leu Lys Gly Leu Met Gly Asn Pro He Cys Ser Pro 
515 520 525 

Gin Tyr Trp Lys Pro Ser Thr Phe Gly Gly Glu Val Gly Phe Lys He 
530 535 540 

He Asn Thr Ala Ser lie Gin Ser Leu He Cys Asn Asn Val Lys Gly 
545 550 555 560 

Cys Pro Phe Thr Ser Phe Asn Val Gin Asp Pro Gin Pro Thr Lys Thr 
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Gly Leu Gly Hie Gly Val Asp Leu Aan His He Tyr Gly Glu Thr Leu 
210 215 220 

Asp Arg Gin His Lys Leu Arg Leu Phe Lys Asp Gly Lys Leu Lys Tyr 
225 230 235 240 

Gin Val He Gly Gly Glu Val Tyr Pro Pro Thr Val Lys Asp Thr Gin 
245 250 255 

Val Glu Met He Tyr Pro Pro His He Pro Glu Asn Leu Gin Phe Ala 
260 265 270 

Val Gly Gin Glu Val Phe Gly Leu Val Pro Gly Leu Met Met Tyr Ala 
275 280 285 

Thr He Trp Leu Arg Glu His Asn Arg Val Cys Asp He Leu Lys Gin 
290 ~ 295 300 

Glu His Pro Glu Trp Gly Asp Glu Gin Leu Phe Gin Thr Ser Arg Leu 
305 310 315 320 

He Leu He Gly Glu Thr He Lys He Val He Glu Asp Tyr Val Gin 
325 330 335 

His Leu Ser Gly Tyr His Phe Lys Leu Lys Phe Asp Pro Glu Leu Leu 
340 345 350 

Phe Asn Gin Gin Phe Gin Tyr Gin Asn Arg He Ala Ser Glu Phe Asn 
355 360 365 

Thr Leu Tyr His Trp His Pro Leu Leu Pro Asp Thr Phe Asn He Glu 
370 375 380 

Asp Gin Glu Tyr Ser Phe Lys Gin Phe Leu Tyr Asn Asn Ser He Leu 
385 390 395 400 

Leu Glu His Gly Leu Thr Gin Phe Val Glu Ser Phe Thr Arg Gin He 
405 410 415 

Ala Gly Arg Val Ala Gly Gly Arg Asn Val Pro He Ala Val Gin Ala 
420 " 425 430 

Val Ala Lys Ala Ser He Asp Gin Ser Arg Glu Met Lys Tyr Gin Ser 
435 440 445 

Leu Asn Glu Tyr Arg Lys Arg Phe Ser Leu Lys Pro Tyr Thr Ser Phe 
450 455 460 

Glu Glu Leu Thr Gly Glu Lys Glu Met Ala Ala Glu Leu Lys Ala Leu 
465 470 475 480 

Tyr Ser Asp He Asp Val Met Glu Leu Tyr Pro Ala Leu Leu Val Glu 
1 485 490 495 

Lys Pro Arg Pro Asp Ala He Phe Gly Glu Thr Met Val Glu Leu Gly 
500 505 510 

Ala Pro Phe Ser Leu Lys Gly Leu Met Gly Asn Pro He Cys Ser Pro 
515 520 525 

Gin Tyr Trp Lys Pro Ser Thr Phe Gly Gly Glu Val Gly Phe Lys He 
530 535 540 

He Asn Thr Ala Ser lie Gin Ser Leu He Cys Asn Asn Val Lys Gly 
545 550 555 560 

Cys Pro Phe Thr Ser Phe Asn Val Gin Asp Pro Gin Pro Thr Lys Thr 
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565 570 575 

Ala Thr lie Aon Ala Ser Ala Ser His Ser Arg Leu Asp Asp He Asn 
580 585 590 

Pro Thr Val Leu He Lys Arg Arg Ser Thr Glu Leu . 
595 600 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1834 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



CCGCTGCGAT 


GCTCGCCCGC 


GCCCTGCTGC 


TGTGCGCGGT 


CCTGGCGCTC 


AGCCATACAG 


60 


CAAATCCTTG 


CTGTTCCCAC 


CCATGTCAAA 


ACCCAGGTGT 


ATGTATGAGT 


GTGGGATTTG 


120 


ACCAGTATAA 


GTGCGATTGT 


ACCCGGACAG 


GATTCTATGG 


AGAAAACTGC 


TCAACACCGG 


180 


AATTTTTGAC 


AAGAATAAAA 


TTATTTCTGA 


AACCCACTCC 


AAACACAGTG 


CACTACATAC 


240 


TTACCCACTT 


CAAGGGATTT 


TGGAACGTTG 


TGAATAACAT 


TCCCTTCCTT 


CGAAATGCAA 


300 


TTATGAGTTA 


TGTGTTGACA 


TCCAGATCAC 


ATTTGATTGA 


CAGTCCACCA 


At 1 1 ALAA JL\j 


joU 


CTGACTATGG 


CTACAAAAGC 


TGGGAAGCCT 


TCTCCAACCT 


CTCCTATTAT 


ACTAGAGCCC 


420 


TTCCTCCTGT 


GCCTGATGAT 


TGCCCGACTC 


CCTTGGGTGT 


CAAAGGTAAA 


AAGCAGCTTC 


480 


CTGATTCAAA 


TGAGATTGTG 


GAAAAATTGC 


TTCTAAGAAG 


AAAGTTCATC 


CCTGATCCCC 


540 


AGGGCTCAAA 


CATGATGTTT 


GCATTCTTTG 


CCCAGCACTT 


CACGCATCAG 


TTTTTCAAGA 


600 


CAGATCATAA 


GCGAGGGCCA 


GCTTTCACCA 


ACGGGCTGGG 


CCATGGGGTG 


GACTTAAATC 


660 


ATATTTACGG 


TGAAACTCTG 


GCTAGACAGC 


GTAAACTGCG 


CCTTTTCAAG 


GATGGAAAAA • 


720 


TGAAATATCA 


GATAATTGAT 


GGAGAGATGT 


ATCCTCCCAC 


AGTCAAAGAT 


ACTCAGGCAG 


780 


AGATGATCTA 


CCCTCCTCAA 


GTCCCTGAGC 


ATCTACGGTT 


TGCTGTGGGG 


CAGGAGGTCT 


840 


TTGGTCTGGT 


GCCTGGTCTG 


ATGATGTATG 


CCACAATCTG 


GCTGCGGGAA 


CACAACAGAG 


900 


TATGCGATGT 


GCTTAAACAG 


GAGCATCCTG 


AATGGGGTGA 


TGAGCAGTTG 


TTCCAGACAA 


960 


GCAGGCTAAT 


ACTGATAGGA 


GAGACTATTA 


AGATTGTGAT 


TGAAGATTAT 


GTGCAACACT 


1020 


TGAGTGGCTA 


TCACTTCAAA 


CTGAAGTTTG 


ACCCAGAACT 


ACTTTTCAAC 


AAACAGTTCC 


1080 


AGTACCAAAA 


TCGTATTGCT 


GCTGAATTTA 


ACACCCTCTA 


TCACTGGCAT 


CCCCTTCTGC 


1140 


CTGACACCTT 


TCAAATTCAT 


GACCAGAAAT 


ACAACTATCA ACAGTTTATC 


TACAACAACT 


1200 


CTATATTGCT 


GGAACATGGA ATTACCCAGT 


TTGTTGAATC 


ATTCACCAGG 


CAGATTGCTG 


1260 


GCAGGGTTGC 


TGGTGGTAGG 


AATGTTCCAC 


. CCGCAGTACA GAAAGTATCA 


CAGGCTTCCA 


1320 
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Ala Thr He Asn Ala Ser Ala Ser His Ser Arg Leu Asp Asp He Asn 
580 585 590 

Pro Thr Val Leu He Lys Arg Arg Ser Thr Glu Leu . 
595 600 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1834 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CCGCTGCGAT GCTCGCCCGC GCCCTGCTGC TGTGCGCGGT CCTGGCGCTC AGCCATACAG 60 

CAAATCCTTG CTGTTCCCAC CCATGTCAAA ACCCAGGTGT ATGTATGAGT GTGGGATTTG 120 

ACCAGTATAA GTGCGATTGT ACCCGGACAG GATTCTATGG AGAAAACTGC TCAACACCGG 180 

AATTTTTGAC AAGAATAAAA TTATTTCTGA AACCCACTCC AAACACAGTG CACTACATAC 240 

TTACCCACTT CAAGGGATTT TGGAACGTTG TGAATAACAT TCCCTTCCTT CGAAATGCAA 300 

TTATGAGTTA TGTGTTGACA TCCAGATCAC ATTTGATTGA CAGTCCACCA ACTTACAATG 360 

CTGACTATGG CTACAAAAGC TGGGAAGCCT TCTCCAACCT CTCCTATTAT ACTAGAGCCC 420 

TTCCTCCTGT GCCTGATGAT TGCCCGACTC CCTTGGGTGT CAAAGGTAAA AAGCAGCTTC 480 

CTGATTCAAA TGAGATTGTG GAAAAATTGC TTCTAAGAAG AAAGTTCATC CCTGATCCCC 540 

AGGGCTCAAA CATGATGTTT GCATTCTTTG CCCAGCACTT CACGCATCAG TTTTTCAAGA 600 

CAGATCATAA GCGAGGGCCA GCTTTCACCA ACGGGCTGGG CCATGGGGTG GACTTAAATC 660 

ATATTTACGG TGAAACTCTG GCTAGACAGC GTAAACTGCG CCTTTTCAAG GATGGAAAAA " 720 

TGAAATATCA GATAATTGAT GGAGAGATGT ATCCTCCCAC AGTCAAAGAT ACTCAGGCAG 780 

AGATGATCTA CCCTCCTCAA GTCCCTGAGC ATCTACGGTT TGCTGTGGGG CAGGAGGTCT 840 

TTGGTCTGGT GCCTGGTCTG ATGATGTATG CCACAATCTG GCTGCGGGAA CACAACAGAG 900 

TATGCGATGT GCTTAAACAG GAGCATCCTG AATGGGGTGA TGAGCAGTTG TTCCAGACAA 960 

GCAGGCTAAT ACTGATAGGA GAGACTATTA AGATTGTGAT TGAAGATTAT GTGCAACACT 1020 

TGAGTGGCTA TCACTTCAAA CTGAAGTTTG ACCCAGAACT ACTTTTCAAC AAACAGTTCC 1080 

AGTACCAAAA TCGTATTGCT GCTGAATTTA ACACCCTCTA TCACTGGCAT CCCCTTCTGC 1140 

CTGACACCTT TCAAATTCAT GACCAGAAAT ACAACTATCA ACAGTTTATC TACAACAACT 1200 

CTATATTGCT GGAACATGGA ATTACCCAGT TTGTTGAATC ATTCACCAGG CAGATTGCTG 1260 

GCAGGGTTGC TGGTGGTAGG AATGTTCCAC . CCGCAGTACA GAAAGTATCA CAGGCTTCCA 1320 



- 84 



WO 96/40720 



PCT/US96/08311 



TTGACCAGAG 


CAGGCAGATG 


AAATACCAGT 


CTTTTAATGA 


GTACCGCAAA 


CGCTTTATGC 


1380 


TGAAGCCCTA 


TGAATCATTT 


GAAGAACTTA 


CAGGAGAAAA 


GGAAATGTCT 


GCAGAGTTGG 


1440 


AAGCACTCTA 


TGGTGACATC 


GATGCTGTGG 


AGCTGTATCC 


TGCCCTTCTG 


GTAGAAAAGC 


1500 


CTCGGCCAGA 


TGCCATCTTT 


CCTCAAACCA 


TCCTACAACT 


TGGAGCACCA 


TTCTCCTTGA 


1560 


AACCACTTAT 


GGGTAATGTT 


ATATGTTCTC 


CTGCCTACTG 


GAAGCCAAGC 


ACTTTTGGTG 


1620 


GAGAAGTGGG 


TTTTCAAATC 


ATCAACACTG 


CCTCAATTCA 


GTCTCTCATC 


TGCAATAACG 


1680 


TGAAGGGCTG 


TCCCTTTACT 


TCATTCAGTG 


TTCCAGATCC 


AGAGCTCATT 


AAAACAGTCA 


1740 


CCATCAATGC 


AAGTTCTTCC 


CGCTCCGGAC 


TAGATGATAT 


CAATCCCACA 


CTACTACTAA 


1800 


AAGAACGTTC 


GACTGAACTG 


TAGAAGTCTA 


ATAC 






1834 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 amino acids 

(B) TYPE: amino acid 

(C) STRANDED NESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Leu Ala Arg Ala Leu Leu Leu Cys Ala Val Leu Ala Leu Ser His 
15 10 15 

Thr Ala Asn Pro Cys Cys Ser His Pro Cys Gin Asn Arg Gly Val Cys 
20 25 30 

Met Ser Val Gly Phe Asp Gin Tyr Lys Cys Asp Cys Thr Arg Thr Gly 
35 " 40 45 

Phe Tyr Gly Glu Asn Cys Ser Thr Pro Glu Phe Leu Thr Arg lie Lys 
50 55 60 

Leu Phe Leu Lys Pro Thr Pro Asn Thr Val His Tyr lie Leu Thr His 
65 70 75 80 

Phe Lys Gly Phe Trp Asn Val Val Asn Asn lie Pro Phe Leu Arg Asn 
85 90 95 

Ala He Met Ser Tyr Val Leu Thr Ser Arg Ser HiB Leu He Asp Ser 
100 105 110 

Pro Pro Thr Tyr Asn Ala Asp Tyr Gly Tyr Lys Ser Trp Glu Ala Phe 
115 120 125 

Ser Asn Leu Ser Tyr Tyr Thr Arg Ala Leu Pro Pro Val Pro Asp Asp 
130 135 140 

CyB Pro Thr Pro Leu Gly Val Lys Gly Lys Lys Gin Leu Pro Asp Ser 
145 150 155 160 

Asn Glu He Val Glu Lys Leu Leu Leu Arg Arg Lys Phe He Pro Asp 
165 170 175 

Pro Gin Gly Ser Asn Met Met Phe Ala Phe Phe Ala Gin His Phe Thr 
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TT6ACCA6AG CAGGCAGATG 


AAATACCAGT 


CTTTTAATGA 


GTACCGCAAA 


CGCTTTATGC 


1380 


TGAAGCCCTA TGAATCATTT 


GAAGAACTTA 


CAGGAGAAAA 


GGAAATGTCT 


GCAGAGTTGG 


1440 


AAGCACTCTA TGGTGACATC 


GATGCTGTGG 


AGCTGTATCC 


TGCCCTTCTG 


GTAGAAAAGC 


1500 


CTCGGCCAGA TGCCATCTTT 


CCTCAAACCA 


TCCTACAACT 


TGGAGCACCA 


TTCTCCTTGA 


1560 


AACCACTTAT GGGTAATGTT 


ATATGTTCTC 


CTGCCTACTG 


GAAGCCAAGC 


ACTTTTGGTG 


1620 


GAGAAGTGGG TTTTCAAATC 


ATCAACACTG 


CCTCAATTCA 


GTCTCTCATC 


TGCAATAACG 


1680 


TGAAGGGCTG TCCCTTTACT 


TCATTCAGTG 


TTCCAGATCC 


AGAGCTCATT 


AAAACAGTCA 


1740 


CGATCAATGC AAGTTCTTCC 


CGCTCCGGAC 


TAGATGATAT 


CAATCCCACA 


CTACTACTAA 


1800 


AAGAACGTTC GACTGAACTG 


TAGAAGTCTA 


ATAC 






1834 


(2) INFORMATION FOR SEQ ID NO: 4: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Leu Ala Arg Ala Leu Leu Leu Cys Ala Val Leu Ala Leu Ser His 
15 10 15 

Thr Ala Asn Pro Cys Cys Ser His Pro Cys Gin Asn Arg Gly Val Cys 
20 25 30 

Met Ser Val Gly Phe Asp Gin Tyr Lys Cys Asp Cys Thr Arg Thr Gly 
35 40 45 

Phe Tyr Gly Glu Asn Cys Ser Thr Pro Glu Phe Leu Thr Arg lie Lys 
50 55 60 

Leu Phe Leu Lys Pro Thr Pro Asn Thr Val His Tyr lie Leu Thr His 
65 70 75 80 

Phe Lys Gly Phe Trp Asn Val Val Asn Asn lie Pro Phe Leu Arg Asn 
85 90 95 

Ala lie Met Ser Tyr Val Leu Thr Ser Arg Ser His Leu lie Asp Ser 
100 105 110 

Pro Pro Thr Tyr Asn Ala Asp Tyr Gly Tyr Lys Ser Trp Glu Ala Phe 
115 120 125 

Ser Asn Leu Ser Tyr Tyr Thr Arg Ala Leu Pro Pro Val Pro Asp Asp 
130 135 140 

Cys Pro Thr Pro Leu Gly Val Lys Gly Lys Lys Gin Leu Pro Asp Ser 
145 150 155 160 

Asn Glu lie Val Glu Lys Leu Leu Leu Arg Arg Lys Phe lie Pro Asp 
165 170 175 

Pro Gin Gly Ser Asn Met Met Phe Ala Phe Phe Ala Gin His Phe Thr 
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180 185 190 

Hia Gin Phe Phe Lye Thr Aap Hie Lya Arg Gly Pro Ala Phe Thr Asn 
195 200 205 

Gly Leu Gly His Gly Val Asp Leu Asn Hia lie, Tyr Gly Glu Thr Leu 
210 215 220 

Ala Arg Gin Arg Lya Leu Arg Leu Phe Lya Aap Gly Lya Met Lya Tyr 
225 230 235 240 

Gin lie lie Aap Gly Glu Met Tyr Pro Pro Thr Val Lys Aap Thr Gin 
245 250 255 

Ala Glu Met lie Tyr Pro Pro Gin Val Pro Glu His Leu Arg Phe Ala 
260 265 270 

Val Gly Gin Glu Val Phe Gly Leu Val Pro Gly Leu Met Met Tyr Ala 
275 280 285 

Thr lie Trp Leu Arg Glu Hia Aan Arg Val Cy8 Asp Val Leu Lya Gin 
290 295 300 

Glu Hia Pro Glu Trp Gly Asp Glu Gin Leu Phe Gin Thr Ser Arg Leu 
305 310 315 320 

He Leu He Gly Glu Thr He Lys He Val He Glu Asp Tyr Val Gin 
325 330 335 

Hia Leu Ser Gly Tyr His Phe Lys Leu Lys Phe Asp Pro Glu Leu Leu 
340 345 350 

Phe Asn Lya Gin Phe Gin Tyr Gin Aan Arg He Ala Ala Glu Phe Asn 
355 360 365 

Thr Leu Tyr His Trp His Pro Leu Leu Pro Aap Thr Phe Gin He His 
370 375 380 

Asp Gin Lys Tyr Asn Tyr Gin Gin Phe He Tyr Asn Asn Ser He Leu 
385 * 390 395 400 

Leu Glu His Gly He Thr Gin Phe Val Glu Ser Phe Thr Arg Gin He 
405 410 415 

Ala Gly Arg Val Ala Gly Gly Arg ABn Val Pro Pro Ala Val Gin Lys 
420 - 425 430 

Val Ser Gin Ala Ser He Aap Gin Ser Arg Gin Met Lys Tyr Gin Ser 
435 440 445 

Phe Asn Glu Tyr Arg Lys Arg Phe Met Leu Lys Pro Tyr Glu Ser Phe 
450 455 460 

Glu Glu Leu Thr Gly Glu Lys Glu Met Ser Ala Glu Leu Glu Ala Leu 
465 470 475 480 

Tyr Gly Asp He Asp Ala Val Glu Leu Tyr Pro Ala Leu Leu Val Glu 
485 490 495 

Lya Pro Arg Pro Asp Ala He Phe Gly Glu Thr Met Val Glu Val Gly 
500 505 510 

Ala Pro Phe Ser Leu Lya Gly Leu Met Gly Aan Val He Cya Ser Pro 
515 520 525 

Ala Tyr Trp Lys Pro Ser Thr Phe Gly Gly Glu Val Gly Phe Gin He 
530 535 540 
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180 



185 



190 



His Gin Phe Phe Lye Thr Asp His Lys Arg Gly Pro Ala Phe Thr Asn 
195 200 205 

Gly Leu Gly His Gly Val Asp Leu Asn His He Tyr Gly Glu Thr Leu 
210 215 220 

Ala Arg Gin Arg Lys Leu Arg Leu Phe Lys Asp Gly Lys Met Lys Tyr 
225 230 235 240 

Gin He He Asp Gly Glu Met Tyr Pro Pro Thr Val Lys Asp Thr Gin 
245 250 255 

Ala Glu Met He Tyr Pro Pro Gin Val Pro Glu His Leu Arg Phe Ala 
260 265 270 

Val Gly Gin Glu Val Phe Gly Leu Val Pro Gly Leu Met Met Tyr Ala 
275 280 285 

Thr He Trp Leu Arg Glu His Asn Arg Val Cys Asp Val Leu Lys Gin 
290 ~ 295 300 

Glu His Pro Glu Trp Gly Asp Glu Gin Leu Phe Gin Thr Ser Arg Leu 
305 310 315 320 

He Leu He Gly Glu Thr He Lys He Val He Glu Asp Tyr Val Gin 
325 330 335 

His Leu Ser Gly Tyr His Phe Lys Leu Lys Phe Asp Pro Glu Leu Leu 
340 345 350 

Phe Asn Lys Gin Phe Gin Tyr Gin Asn Arg He Ala Ala Glu Phe Asn 
355 360 365 

Thr Leu Tyr His Trp His Pro Leu Leu Pro Asp Thr Phe Gin He His 
370 " 375 380 

Asp Gin Lys Tyr Asn Tyr Gin Gin Phe He Tyr Asn Asn Ser He Leu 
385 ' ~ 390 395 400 

Leu Glu His Gly He Thr Gin Phe Val Glu Ser Phe Thr Arg Gin He 
405 410 415 

Ala Gly Arg Val Ala Gly Gly Arg Asn Val Pro Pro Ala Val Gin Lys 
420 425 430 

Val Ser Gin Ala Ser He Asp Gin Ser Arg Gin Met Lys Tyr Gin Ser 
435 440 445 

Phe Asn Glu Tyr Arg Lys Arg Phe Met Leu Lys Pro Tyr Glu Ser Phe 
450 455 460 



Glu Glu Leu Thr Gly Glu Lys Glu Met Ser Ala Glu Leu Glu Ala Leu 
465 470 475 480 

Tyr Gly Asp He Asp Ala Val Glu Leu Tyr Pro Ala Leu Leu Val Glu 
485 490 495 

Lys Pro Arg Pro Asp Ala He Phe Gly Glu Thr Met Val Glu Val Gly 
500 i 505 510 

Ala Pro Phe Ser Leu Lys Gly Leu Met Gly Asn Val He Cys Ser Pro 
515 520 525 

Ala Tyr Trp Lys Pro Ser Thr Phe Gly Gly Glu Val Gly Phe Gin He 
530 535 540 
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lie Asn Thr Ala Ser lie Gin Ser Leu lie Cys Asn Asn Val Lys Gly 
545 550 555 560 

Cys Pro Phe Thr Ser Phe Ser Val Pro Asp Pro Glu Leu lie Lys Thr 
565 570 575 

Val Thr He Asn Ala Ser Ser Ser Arg Ser Gly Leu Asp Asp He Asn 
580 585 590 

Pro Thr Val Leu Leu Lys Glu Arg Ser Thr Glu Leu 
595 600 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Leu Ala Arg Ala Leu Leu Leu Cys Ala Val Leu Ala Leu Ser His 
1 5 10 15 

Thr Ala ABn Pro Cys Cys Ser His Pro Cys Gin Asn Arg Gly Val Cys 
20 " 25 30 

Met Ser Val Gly Phe Asp Gin Tyr Lys Cys Asp Cys Thr Arg Thr Gly 
35 40 45 

Phe Tyr Gly Glu Asn Cys Ser Thr Pro Glu Phe Leu Thr Arg He Lys 
50 ^ 55 60 

Leu Phe Leu Lys Pro Thr Pro Asn Thr Val His Tyr He Leu Thr His 
65 70 75 80 

Phe Lys Gly Phe Trp Asn Val Val Asn Asn He Pro Phe Leu Arg Asn 
85 90 95 

Ala He Met Ser Tyr Val Leu Thr Ser Arg Ser His Leu He Asp Ser 
100 105 110 

Pro Pro Thr Tyr Asn Ala Asp Tyr Gly Tyr Lys Ser Trp Glu Ala Phe 
115 120 125 

Ser ABn Leu Ser Tyr Tyr Thr Arg Ala Leu Pro Pro Val Pro Asp Asp 
130 - 135 140 

Cys Pro Thr Pro Leu Gly Val Lys Gly Lys Lys Gin Leu Pro Asp Ser 
145 150 155 160 

Asn Glu He Val Gly Lys Leu Leu Leu Arg Arg Lye Phe He Pro Asp 
165 170 175 

Pro Gin Gly Ser Asn Met Met Phe Ala Phe Phe Ala Gin His Phe Thr 
180 185 190 

His Gin Phe Phe Lys Thr Asp His Lys Arg Gly Pro Ala Phe Thr Asn 
195 " 200 205 

Gly Leu Gly His Gly Val Asp Leu Asn His He Tyr Gly Glu Thr Leu 
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lie Asn Thr Ala Ser lie Gin Ser Leu lie Cys Asn Asn Val Lys Gly 
545 550 555 560 

Cys Pro Phe Thr Ser Phe Ser Val Pro Asp Pro Glu Leu lie Lys Thr 
565 570 575 

Val Thr lie Asn Ala Ser Ser Ser Arg Ser Gly Leu Asp Asp lie Asn 
580 585 590 

Pro Thr Val Leu Leu Lys Glu Arg Ser Thr Glu Leu 
595 600 

(2) INFORMATION FOR SEQ ID NO: 5: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Leu Ala Arg Ala Leu Leu Leu Cys Ala Val Leu Ala Leu Ser His 
1 5 10 15 

Thr Ala Asn Pro Cys Cys Ser His Pro Cys Gin Asn Arg Gly Val Cys 
20 25 30 

Met Ser Val Gly Phe Asp Gin Tyr Lys Cys Asp Cys Thr Arg Thr Gly 
35 40 45 

Phe Tyr Gly Glu Asn Cys Ser Thr Pro Glu Phe Leu Thr Arg lie Lys 
50 55 60 

Leu Phe Leu Lys Pro Thr Pro Asn Thr Val His Tyr lie Leu Thr His 
65 70 75 80 

Phe Lys Gly Phe Trp Asn Val Val Asn Asn He Pro Phe Leu Arg Asn 
85 90 95 

Ala He Met Ser Tyr Val Leu Thr Ser Arg Ser His Leu He Asp Ser 
100 105 110 

Pro Pro Thr Tyr Asn Ala Asp Tyr Gly Tyr Lys Ser Trp Glu Ala Phe 
115 120 125 

Ser ABn Leu ser Tyr Tyr Thr Arg Ala Leu Pro Pro Val Pro Asp Asp 
130 135 140 

Cys Pro Thr Pro Leu Gly Val Lys Gly Lys Lys Gin Leu Pro Asp Ser 
145 150 155 160 

Asn Glu He Val Gly Lys Leu Leu Leu Arg Arg Lys Phe He Pro Asp 
165 170 175 

Pro Gin Gly Ser Asn Met Met Phe Ala Phe Phe Ala Gin His Phe Thr 
180 185 190 

His Gin Phe Phe Lys Thr Asp His Lys Arg Gly Pro Ala Phe Thr Asn 
195 ^ 200 205 

Gly Leu Gly His Gly Val Asp Leu Asn Hia He Tyr Gly Glu Thr Leu 
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210 



215 



220 



Ala Arg Gin Arg Lys Leu Arg Leu Phe Lys Asp Gly Lys Met Lys Tyr 
225 230 235 240 

Gin He He Asp Gly Glu Met Tyr Pro Pro Thr Val Lys Asp Thr Gin 
245 250 255 

Ala Glu Met He Tyr Pro Pro Gin Val Pro Glu His Leu Arg Phe Ala 
260 265 270 

Val Gly Gin Glu Val Phe Gly Leu Val Pro Gly Leu Met Met Tyr Ala 
275 280 285 

Thr He Trp Leu Arg Glu His Asn Arg Val Cys Asp Val Leu Lys Gin 
290 295 300 

Glu His Pro Glu Trp Gly Asp Glu Gin Leu Phe Gin Thr Ser Arg Leu 
305 310 315 320 

He Leu He Gly Glu Thr He Lys He Val He Glu Asp Tyr Val Gin 
325 330 335 

His Leu Ser Gly Tyr His Phe Lys Leu Lys Phe Asp Pro Glu Leu Leu 
340 345 350 

Phe Asn Lys Gin Phe Gin Tyr Gin Asn Arg He Ala Ala Glu Phe Asn 
355 360 365 

Thr Leu Tyr His Trp His Pro Leu Leu Pro Asp Thr Phe Gin He His 
370 ~ 375 380 

Asp Gin Lys Tyr Asn Tyr Gin Gin Phe He Tyr Asn Asn Ser He Leu 
385 390 395 400 

Leu Glu His Gly He Thr Gin Phe Val Glu Ser Phe Thr Arg Gin He 
405 410 415 

Ala Gly Arg Val Ala Gly Gly Arg Asn Val Pro Pro Ala Val Gin Lys 
420 425 430 

Val Ser Gin Ala Ser He Asp Gin Ser Arg Gin Met Lys Tyr Gin Ser 
435 440 445 

Phe Asn Glu Tyr Arg Lys Arg Phe Met Leu Lys Pro Tyr Glu Ser Phe 
450 " 455 460 

Glu Glu Leu Thr Gly Glu Lys Glu Met Ser Ala Glu Leu Glu Ala Leu 
465 470 475 480 

Tyr Gly Asp He Asp Ala Val Glu Leu Tyr Pro Ala Leu Leu Val Glu 
485 490 495 

Lys Pro Arg Pro Asp Ala He Phe Gly Glu Thr Met Val Glu Val Gly 
500 505 510 

Ala Pro Phe Ser Leu Lys Gly Leu Met Gly Asn Val He Cys Ser Pro 
515 520 525 

Ala Tyr Trp Lys Pro Ser Thr Phe Gly Gly Glu Val Gly Phe Gin He 
530 535 540 

He Asn Thr Ala Ser He Gin Ser Leu He Cys Asn Asn Val Lys Gly 
545 550 555 560 

Cys Pro Phe Thr Ser Phe Ser Val Pro Asp Pro Glu Leu He Lys Thr 



565 



570 



575 
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210 



215 



220 



Ala Arg Gin Arg Lys Leu Arg Leu Phe Lye Asp Gly Lys Met Lys Tyr 
225 230 235 240 

Gin lie lie Asp Gly Glu Met Tyr Pro Pro Thr Val Lys Asp Thr Gin 
245 250 255 

Ala Glu Met lie Tyr Pro Pro Gin Val Pro Glu His Leu Arg Phe Ala 
260 265 270 

Val Gly Gin Glu Val Phe Gly Leu Val Pro Gly Leu Met Met Tyr Ala 
275 280 285 

Thr lie Trp Leu Arg Glu His Asn Arg Val Cys Asp Val Leu Lys Gin 
290 ~ 295 300 

Glu His Pro Glu Trp Gly Asp Glu Gin Leu Phe Gin Thr Ser Arg Leu 
305 310 315 320 

lie Leu lie Gly Glu Thr lie Lys lie Val lie Glu Asp Tyr Val Gin 
325 330 335 

His Leu Ser Gly Tyr His Phe Lys Leu Lys Phe Asp Pro Glu Leu Leu 
340 345 350 

Phe Asn Lys Gin Phe Gin Tyr Gin ABn Arg lie Ala Ala Glu Phe Asn 
355 360 365 

Thr Leu Tyr His Trp His Pro Leu Leu Pro Asp Thr Phe Gin He His 
370 375 380 

Asp Gin Lys Tyr Asn Tyr Gin Gin Phe He Tyr Asn Asn Ser He Leu 
385 390 395 400 

Leu Glu His Gly He Thr Gin Phe Val Glu Ser Phe Thr Arg Gin He 
405 410 415 

Ala Gly Arg Val Ala Gly Gly Arg Asn Val Pro Pro Ala Val Gin Lys 
420 425 430 

Val Ser Gin Ala Ser He Asp Gin Ser Arg Gin Met Lys Tyr Gin Ser 
435 440 445 

Phe Asn Glu Tyr Arg Lys Arg Phe Met Leu Lys Pro Tyr Glu Ser Phe 
450 455 460 

Glu Glu Leu Thr Gly Glu Lys Glu Met Ser Ala Glu Leu Glu Ala Leu 
465 470 475 480 

Tyr Gly Asp He Asp Ala Val Glu Leu Tyr Pro Ala Leu Leu Val Glu 
485 490 495 

Lys Pro Arg Pro Asp Ala He Phe Gly Glu Thr Met Val Glu Val Gly 
500 505 510 

Ala Pro Phe Ser Leu Lys Gly Leu Met Gly Asn Val He Cys Ser Pro 
515 520 525 

Ala Tyr Trp Lys Pro Ser Thr Phe Gly Gly Glu Val Gly Phe Gin He 
530 535 540 

He Asn Thr Ala Ser He Gin Ser Leu He Cys Asn Asn Val Lys Gly 
545 550 555 560 

Cys Pro Phe Thr Ser Phe Ser Val Pro Asp Pro Glu Leu He Lys Thr 
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Val Thr lie Asn Ala Ser Ser Ser Arg Ser Gly Leu Asp Asp lie Asn 
580 585 590 

Pro Thr Val Leu Leu Lys Glu Arg Ser Thr Glu Leu 
595 600 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1819 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



CCGCGCCATG AGCCGGAGTC TCTTGCTCCG GTTCTTGCTG 


TTGCTGCTCC 


TGCTCCCGCC 


60 


GCTCCCCGTC CTGCTCGCGG ACCCAGGGGC GCCCACGCCA GTGAATCCCT GTTGTTACTA 


120 


TCCATGCCAG CACCAGGGCA TCTGTGTCCG CTTCGGCCTT 


GACCGCTACC 


AGTGTGACTG 


180 


CACCCGCACG GGCTATTCCG GCCCCAACTG CACCATCCCT 


GGCCTGTGGA 


CCTGGCTCCG 


240 


GAATTCACTG CGGCCCAGCC CCTCTTTCAC CCACTTCCTG 


CTCACTCACG 


GGCGCTGGTT 


300 


CTGGGAGTTT GTCAATGCCA CCTTCATCCG AGAGATGCTC 


ATGCTCCTGG 


TACTCACAGT 


360" 


GCGCTCCAAC CTTATCCCCA GTCCCCCCAC CTACAACTCT 


GCACATGACT 


ACATCAGCTG 


420 


GGAGTCTTTC TCCAACGTGA GCTATTACAC TCGTATTCTG CCCTCTGTGC 


CTAAAGATTG 


480 


CCCCACACCC ATGGGAACCA AAGGGAAGAA GCAGTTGCCA 


GATGCCCAGC 


TCCTGGCCCG 


540 


CCGCTTCCTG CTCAGGAGGA AGTTCATACC TGACCCCCAA 


GGCACCAACC 


TCATGTTTGC 


600 


CTTCTTTGCA CAACACTTCA CCCACCAGTT CTTCAAAACT 


TCTGGCAAGA 


TGGGTCCTGG 


660 


CTTCACCAAG GCCTTGGGCC ATGGGGTAGA CCTCGGCCAC 


ATTTATGGAG 


ACAATCTGGA 


720 


GCGTCAGTAT CAACTGCGGC TCTTTAAGGA TGGGAAACTC 


AAGTACCAGG 


TGCTGGATGG 


780 


AGAAATGTAC CCGCCCTCGG TAGAAGAGGC GCCTGTGTTG 


ATGCACTACC 


CCCGAGGCAT 


840 


CCCGCCCCAG AGCCAGATGG CTGTGGGCCA GGAGGTGTTT 


GGGCTGCTTC 


CTGGGCTCAT 


900 


GCTGTATGCC ACGCTCTGGC TACGTGAGCA CAACCGTGTG 


TGTGACCTGC 


TGAAGGCTGA 


960 


GCACCCCACC TGGGGCGATG AGCAGCTTTT CCAGACGACC 


CGCCTCATCC 


TCATAGGGGA 


1020 


GACCATCAAG ATTGTCATCG AGGAGTACGT GCAGCAGCTG 


AGTGGCTATT 


TCCTGCAGCT 


1080 


GAAATTTGAC CCAGAGCTGC TGTTCGGTGT CCAGTTCCAA 


TACCGCAACC 


GCATTGCCAC 


1140 


GGAGTTCAAC CATCTCTACC ACTGGCACCC CCTCATGCCT 


GACTCCTTCA AGGTGGGCTC 


1200 


CCAGGAGTAC AGCTACGAGC AGTTCTTGTT CAACACCTCC 


ATGTTGGTGG 


ACTATGGGGT 


1260 


TGAGGCCCTG GTGGATGCCT TCTCTCGCCA GATTGCTGGC 


CGGATCGGTG 


GGGGCAGGAA 


1320 


CATGGACCAC CACATCCTGC ATGTGGCTGT GGATGTCATC 


AGGGAGTCTC 


GGGAGATGCG 


1380 
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Val Thr lie Asn Ala Ser Ser Ser Arg Ser Gly Leu Asp Asp He Asn 
580 585 590 

Pro Thr Val Leu Leu Lys Glu Arg Ser Thr Glu Leu 
595 600 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1819 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



CCGCGCCATG AGCCGGAGTC TCTTGCTCCG GTTCTTGCTG TTGCTGCTCC 


TGCTCCCGCC 


60 


GCTCCCCGTC CTGCTCGCGG ACCCAGGGGC GCCCACGCCA GTGAATCCCT GTTGTTACTA 


120 


TCCATGCCAG CACCAGGGCA TCTGTGTCCG CTTCGGCCTT GACCGCTACC 


AGTGTGACTG 


180 


CACCCGCACG GGCTATTCCG GCCCCAACTG CACCATCCCT GGCCTGTGGA 


CCTGGCTCCG 


240 


GAATTCACTG CGGCCCAGCC CCTCTTTCAC CCACTTCCTG CTCACTCACG 


GGCGCTGGTT 


300 


CTGGGAGTTT GTCAATGCCA CCTTCATCCG AGAGATGCTC ATGCTCCTGG 


TACTCACAGT 


360* 


GCGCTCCAAC CTTATCCCCA GTCCCCCCAC CTACAACTCT GCACATGACT 


ACATCAGCTG 


420 


GGAGTCTTTC TCCAACGTGA GCTATTACAC TCGTATTCTG CCCTCTGTGC 


CTAAAGATTG 




CCCCACACCC ATGGGAACCA AAGGGAAGAA GCAGTTGCCA GATGCCCAGC 


TCCTGGCCCG 


540 


CCGCTTCCTG CTCAGGAGGA AGTTCATACC TGACCCCCAA GGCACCAACC 


TCATGTTTGC 


600 


CTTCTTTGCA CAACACTTCA CCCACCAGTT CTTCAAAACT TCTGGCAAGA 


TGGGTCCTGG 


660 


CTTCACCAAG GCCTTGGGCC ATGGGGTAGA CCTCGGCCAC ATTTATGGAG 


ACAATCTGGA 


720 


GCGTCAGTAT CAACTGCGGC TCTTTAAGGA TGGGAAACTC AAGTACCAGG 


TGCTGGATGG 


780 


AGAAATGTAC CCGCCCTCGG TAGAAGAGGC GCCTGTGTTG ATGCACTACC 


CCCGAGGCAT 


840 


CCCGCCCCAG AGCCAGATGG CTGTGGGCCA GGAGGTGTTT GGGCTGCTTC 


CTGGGCTCAT 


900 


GCTGTATGCC ACGCTCTGGC TACGTGAGCA CAACCGTGTG TGTGACCTGC 


TGAAGGCTGA 


960 


GCACCCCACC TGGGGCGATG AGCAGCTTTT CCAGACGACC CGCCTCATCC 


TCATAGGGGA 


1020 


GACCATCAAG ATTGTCATCG AGGAGTACGT GCAGCAGCTG AGTGGCTATT 


TCCTGCAGCT 


1080 


GAAATTTGAC CCAGAGCTGC TGTTCGGTGT CCAGTTCCAA TACCGCAACC 


GCATTGCCAC 


1140 


GGAGTTCAAC CATCTCTACC ACTGGCACCC CCTCATGCCT GACTCCTTCA AGGTGGGCTC 


1200 


. CCAGGAGTAC AGCTACGAGC AGTTCTTGTT CAACACCTCC ATGTTGGTGG 


ACTATGGGGT 


1260 


TGAGGCCCTG GTGGATGCCT TCTCTCGCCA GATTGCTGGC CGGATCGGTG 


GGGGCAGGAA 


1320 


CATGGACCAC CACATCCTGC ATGTGGCTGT GGATGTCATC AGGGAGTCTC 


GGGAGATGCG 


1380 



- 89 - 



WO 96/40720 



PCT/US96/08311 



GCTGCAGCCC TTCAATGAGT 


ACCGCAAGAG 


GTTTGGCATG 


AAACCCTACA 


CCTCCTTCCA 


T A A n 

J.44U 


GGAGCTCGTA GGAGAGAAGG 


AGATGGCAGC 


AGAGTTGGAG 


GAATTGTATG 


GAGACATTGA 


lbUU 


TGCGTTGGAG TTCTACCCTG 


GACTGCTTCT 


TGAAAAGTGC 


CATCCAAACT 


CTATCTTTGG 


J-O DU 


GGAGAGTATG ATAGAGATTG 


GGGCTCCCTT TTCCCTCAAG GGTCTCCTAG GGAATCCCAT 


1620 


CTGTTCTCCG GAGTACTGGA 


AGCCGAGCAC 


ATTTGGCGGC 


GAGGTGGGCT 


TTAACATTGT 


1680 


CAAGACGGCC ACACTGAAGA 


AGCTGGTCTG 


CCTCAACACC 


AAGACCTGTC 


CCTACGTTTC 


1740 


CTTCCGTGTG CCGGATGCCA 


GTCAGGATGA 


TGGGCCTGCT 


GTGGAGCGAC 


CATCCACAGA 


1800 


GCTCTGAGGG GCAGGAAAG 










1819 


(2) INFORMATION FOR SEQ ID NO: 7: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Thr lie Trp Leu Arg Glu His Asn Arg Val 
15 10 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
<C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Lys Ala Leu Gly His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
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GCTGCAGCCC 


TTCAATGAGT 


ACCGCAAGAG 


GTTTGGCATG 


AAACCCTACA 


CCTCCTTCCA 


1440 


GGAGCTCGTA 


GGAGAGAAGG 


AGATGGCAGC 


AGAGTTGGAG 


GAATTGTATG 


GAGACATTGA 


1500 


TGCGTTGGAG 


TTCTACCCTG 


GACTGCTTCT 


TGAAAAGTGC 


CATCCAAACT 


CXATCTX TUb 


i c c r\ 


GGAGAGTATG 


ATAGAGATTG 


GGGCTCCCTT 


TTCCCTCAAG 


GGTCTCCTAG 


GGAATCCCAT 


1620 


CTGTTCTCCG 


GAGTACTGGA 


AGCCGAGCAC 


ATTTGGOGGC 


GAGGTGGGCT 


TTAACATTGT 


1680 


CAAGACGGCC 


ACACTGAAGA 


AGCTGGTCTG 


CCTCAACACC 


AAGACCTGTC 


CCTACGTTTC 


1740 


CTTCCGTGTG 


COGGATGCCA 


GTCAGGATGA 


TGGGCCTGCT 


GTGGAGCGAC 


CATCCACAGA 


1800 


GCTCTGAGGG 


GCAGGAAAG 










1819 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Thr lie Trp Leu Arg Glu His Asn Arg Val 
15 10 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Lys Ala Leu Gly His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
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Arg Gly Leu Gly His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CTTACCCGAA GCTTGCGCCA TGAGCCGG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TTCGTTAGCG GCCGCTGCCC CTCAGAGC 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TCATTCTAAG CTTCCGCTGC GATGCTCGC 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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Arg Gly Leu Gly His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CTTACCCGAA GCTTGCGCCA TGAGCCGG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TTCGTTAGCG GCCGCTGCCC CTCAGAGC 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TCATTCTAAG CTTCCGCTGC GATGCTCGC 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 baBe pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GATCATGCGG CCGCATTAGA CTTCTACAG 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1834 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CCGCTGCGAT GCTCGCCCGC GCCCTGCTGC TGTGCGCGGT CCTGGCGCTC AGCCATACAG 
CAAATCCTTG CTGTTCCCAC CCATGTCAAA ACCCAGGTGT ATGTATGAGT GTGGGATTTG 
ACCAGTATAA GTGCGATTGT ACCCGGACAG GATTCTATGG AGAAAACTGC TCAACACCGG 
AATTTTTGAC AAGAATAAAA TTATTTCTGA AACCCACTCC AAACACAGTG CACTACATAC 
TTACCCACTT CAAGGGATTT TGGAACGTTG TGAATAACAT TCCCTTCCTT CGAAATGCAA 
TTATGAGTTA TGTGTTGACA TCCAGATCAC ATTTGATTGA CAGTCCACCA ACTTACAATG 
CTGACTATGG CTACAAAAGC TGGGAAGCCT TCTCCAACCT CTCCTATTAT ACTAGAGCCC 
TTCCTCCTGT GCCTGATGAT TGCCCGACTC CCTTGGGTGT CAAAGGTAAA AAGCAGCTTC 
CTGATTCAAA TGAGATTGTG GAAAAATTGC TTCTAAGAAG AAAGTTCATC CCTGATCCCC 
AGGGCTCAAA CATGATGTTT GCATTCTTTG CCCAGCACTT CACGCATCAG TTTTTCAAGA 
CAGATCATAA GCGAGGGCCA GCTTTCACCA ACGGGCTGGG CCATGGGGTG GACTTAAATC 
ATATTTACGG TGAAACTCTG GCTAGACAGC GTAAACTGCG CCTTTTCAAG GATGGAAAAA 
TGAAATATCA GATAATTGAT GGAGAGATGT ATCCTCCCAC AGTCAAAGAT ACTCAGGCAG 
AGATGATCTA CCCTCCTCAA GTCCCTGAGC ATCTACGGTT TGCTGTGGGG CAGGAGGTCT 
TTGGTCTGGT GCCTGGTCTG ATGATGTATG CCACAATCTG GCTGCGGGAA CACAACAGAG 
TATGCGATGT GCTTAAACAG GAGCATCCTG AATGGGGTGA TGAGCAGTTG TTCCAGACAA 
GCAGGCTAAT ACTGATAGGA GAGACTATTA AGATTGTGAT TGAAGATTAT GTGCAACACT 
TGAGTGGCTA TCACTTCAAA CTGAAGTTTG ACCCAGAACT ACTTTTCAAC AAACAGTTCC 
AGTACCAAAA TCGTATTGCT GCTGAATTTA ACACCCTCTA TCACTGGCAT CCCCTTCTGC 
CTGACACCTT TCAAATTCAT GACCAGAAAT ACAACTATCA ACAGTTTATC TACAACAACT 
CTATATTGCT GGAACATGGA ATTACCCAGT TTGTTGAATC ATTCACCAGG CAGATTGCTG 
GCAGGGTTGC TGGTGGTAGG AATGTTCCAC CCGCAGTACA GAAAGTATCA CAGGCTTCCA 
TTGACCAGAG CAGGCAGATG AAATACCAGT CTTTTAATGA GTACCGCAAA CGCTTTATGC 
TGAAGCCCTA TGAATCATTT GAAGAACTTA CAGGAGAAAA GGAAATGTCT GCAGAGTTGG 



60 
120 
180 
240 
300 
360 
420 " 

480 

540 

600 

660 

720 

780 

840 

900 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GATCATGCGG CCGCATTAGA CTTCTACAG 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1834 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CCGCTGCGAT GCTCGCCCGC GCCCTGCTGC TGTGCGCGGT CCTGGCGCTC AGCCATACAG 60 

CAAATCCTTG CTGTTCCCAC CCATGTCAAA ACCCAGGTGT ATGTATGAGT GTGGGATTTG 120 

ACCAGTATAA GTGCGATTGT ACCCGGACAG GATTCTATGG AGAAAACTGC TCAACACCGG 180 

AATTTTTGAC AAGAATAAAA TTATTTCTGA AACCCACTCC AAACACAGTG CACTACATAC 240 

TTACCCACTT CAAGGGATTT TGGAACGTTG TGAATAACAT TCCCTTCCTT CGAAATGCAA 300 

TTATGAGTTA TGTGTTGACA TCCAGATCAC ATTTGATTGA CAGTCCACCA ACTTACAATG 360 

CTGACTATGG CTACAAAAGC TGGGAAGCCT TCTCCAACCT CTCCTATTAT ACTAGAGCCC 420 

TTCCTCCTGT GCCTGATGAT TGCCCGACTC CCTTGGGTGT CAAAGGTAAA AAGCAGCTTC 480 

CTGATTCAAA TGAGATTGTG GAAAAATTGC TTCTAAGAAG AAAGTTCATC CCTGATCCCC 540 

AGGGCTCAAA CATGATGTTT GCATTCTTTG CCCAGCACTT CACGCATCAG TTTTTCAAGA 600 

CAGATCATAA GCGAGGGCCA GCTTTCACCA ACGGGCTGGG CCATGGGGTG GACTTAAATC 660 

ATATTTACGG TGAAACTCTG GCTAGApAGC GTAAACTGCG CCTTTTCAAG GATGGAAAAA 720 

TGAAATATCA GATAATTGAT GGAGAGATGT ATCCTCCCAC AGTCAAAGAT ACTCAGGCAG 780 

AGATGATCTA CCCTCCTCAA GTCCCTGAGC ATCTACGGTT TGCTGTGGGG CAGGAGGTCT 840 

TTGGTCTGGT GCCTGGTCTG ATGATGTATG CCACAATCTG GCTGCGGGAA CACAACAGAG 900 

TATGCGATGT GCTTAAACAG GAGCATCCTG AATGGGGTGA TGAGCAGTTG TTCCAGACAA 960 

GCAGGCTAAT ACTGATAGGA GAGACTATTA AGATTGTGAT TGAAGATTAT GTGCAACACT 1020 

TGAGTGGCTA TCACTTCAAA CTGAAGTTTG ACCCAGAACT ACTTTTCAAC AAACAGTTCC 1080 

AGTACCAAAA TCGTATTGCT GCTGAATTTA ACACCCTCTA TCACTGGCAT CCCCTTCTGC 1140 

CTGACACCTT TCAAATTCAT GACCAGAAAT ACAACTATCA ACAGTTTATC TACAACAACT 1200 

CTATATTGCT GGAACATGGA ATTACCCAGT TTGTTGAATC ATTCACCAGG CAGATTGCTG 1260 

GCAGGGTTGC TGGTGGTAGG AATGTTCCAC CCGCAGTACA GAAAGTATCA CAGGCTTCCA 1320 

TTGACCAGAG CAGGCAGATG AAATACCAGT CTTTTAATGA GTACCGCAAA CGCTTTATGC 1380 

TGAAGCCCTA TGAATCATTT GAAGAACTTA CAGGAGAAAA GGAAATGTCT GCAGAGTTGG 1440 
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AAGCACTCTA TGGTGACATC GATGCTGTGG AGCTGTATCC TGCCCTTCTG GTAGAAAAGC 1500 

CTCGGCCAGA TGCCATCTTT CCTCAAACCA TCCTACAACT TGGAGCACCA TTCTCCTTGA 1560 

AACCACTTAT GGGTAATGTT ATATGTTCTC CTGCCTACTG GAAGCCAAGC ACTTTTGGTG 1620 

GAGAAGTGGG TTTTCAAATC ATCAACACTG CCTCAATTCA GTCTCTCATC TGCAATAACG 1680 

TGAAGGGCTG TCCCTTTACT TCATTCAGTG TTCCAGATCC AGAGCTCATT AAAACAGTCA 1740 

CCATCAATGC AAGTTCTTCC CGCTCCGGAC TAGATGATAT CAATCCCACA CTACTACTAA 1800 

AAGAACGTTC GACTGAACTG TAGAAGTCTA ATAC 1834 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CTCGATCAAA CCTTTTTTTT ATGGTACACA ATAGTCACAG TACTTTTCCA TATAAAACAG 
GTTTAGTGGT CTTAATTTAG TTTGGCACAT TTAATACACT CCCATGACCA GCATCCCAAA 
TGTACCTATC CGTTTTATTT TATTGTCTCA GAATTGTCAG TTATTTAATA AATTATGTAA 
CTTTTTTCCT TATGCTCAGA TTTGCACTTC TTTCTAAAAC TCTGCCCATC CTTAAAGTCC 
CAGATTCTCC TTGAACTTTT TTTTTTGACT TTCCAAGTAC ATGGAACTCT TCACTCTATC 
CTGCTATATA AGGTGACAGA ATTTCCACTA TGGGATAGAT GGAGTTCAAT TCCTTTGAGT 
TTAAAATAAT CTAAATATAA TTATTCCTTA TGCCCTGTTT TTCCCTCACT TTTGTATCCA 
AATCTCTTTT CAGACAACAG AACAATTAAT GTCTGATAAG GAAGACAATG ATGATGATCA 
CTTCAAAATG AATTCAGGAT TGtAATGTAA AATTTTAGTA CTCTCTCACA GTATGGATTC 
TAACATGGCT TCTAACCCAA ACTAACATTA GTAGCTCTAA CTATAAACTT CAAATTTCAG 
TAGATGCAAC CTACTCCTTT AAAATGAAAC AGAAGATTGA AATTATTAAA TTATCAAAAA 
GAAAATGATC CACGCTCTTA GTTGAAATTT CATGTAAGAT TCCATGCAAT AAATAGGAGT 
GCCATAAATG GAATGATGAA ATATGACTAG AGGAGGAGAA AGGCTCCTAG ATGAGATGGG 
ATTTTAGGCA TCCGTGTCTC ATGAGGAATC AGTTGTGTCA CTAGGCAAAA CAGTAAAAAA 
AAAAACCTCC AAGTGAGTCT CTTATTTATT TTTTTCTTAT AAGACTTCTA CAAATTGAGG 
TACCTGGTGT AGTTTTATTT CAGGTTTTAT GCTGTCATTT TCCTGTAATG CTAAGGACTT 
AGGACATAAC TGAATTTTCT ATTTTCCACT TCTTTTCTGG TGTGTGTGTA TATATATATG 
TATATATACA CACACACATA TACATATATA TATTTTTTAG TATCTCACCC TCACATGCTC 
CTCCCTGAGC ACTACCCATG ATAGATGTTA AACAAAAGCA AAGATGAAAT TCCAACTGTC 
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AAGCACTCTA TGGTGACATC GATGCTGTGG AGCTGTATCC TGCCCTTCTG GTAGAAAAGC 1500 

CTCGGCCAGA TGCCATCTTT CCTCAAACCA TCCTACAACT TGGAGCACCA TTCTCCTTGA 1560 

AACCACTTAT GGGTAATGTT ATATGTTCTC CTGCCTACTG GAAGCCAAGC ACTTTTGGTG 1620 

GAGAAGTGGG TTTTCAAATC ATCAACACTG CCTCAATTCA GTCTCTCATC TGCAATAACG 1680 

TGAAGGGCTG TCCCTTTACT TCATTCAGTG TTCCAGATCC AGAGCTCATT AAAACAGTCA 1740 

CCATCAATGC AAGTTCTTCC CGCTCCGGAC TAGATGATAT CAATCCCACA CTACTACTAA 1800 

AAGAACGTTC GACTGAACTG TAGAAGTCTA ATAC 1834 
(2) INFORMATION FOR SEQ ID NO: 15 J 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CTCGATCAAA CCTTTTTTTT ATGGTACACA ATAGTCACAG TACTTTTCCA TATAAAACAG 
GTTTAGTGGT CTTAATTTAG TTTGGCACAT TTAATACACT CCCATGACCA GCATCCCAAA 
TGTACCTATC CGTTTTATTT TATTGTCTCA GAATTGTCAG TTATTTAATA AATTATGTAA 
CTTTTTTCCT TATGCTCAGA TTTGCACTTC TTTCTAAAAC TCTGCCCATC CTTAAAGTCC 
CAGATTCTCC TTGAACTTTT TTTTTTGACT TTCCAAGTAC ATGGAACTCT TCACTCTATC 
CTGCTATATA AGGTGACAGA ATTTCCACTA TGGGATAGAT GGAGTTCAAT TCCTTTGAGT 
TTAAAATAAT CTAAATATAA TTATTCCTTA TGCCCTGTTT TTCCCTCACT TTTGTATCCA 
AATCTCTTTT CAGACAACAG AACAATTAAT GTCTGATAAG GAAGACAATG ATGATGATCA 
CTTCAAAATG AATTCAGGAT TGTAATGTAA AATTTTAGTA CTCTCTCACA GTATGGATTC 
TAACATGGCT TCTAACCCAA ACTAACATTA GTAGCTCTAA CTATAAACTT CAAATTTCAG 
TAGATGCAAC CTACTCCTTT AAAATGAAAC AGAAGATTGA AATTATTAAA TTATCAAAAA 
GAAAATGATC CACGCTCTTA GTTGAAATTT CATGTAAGAT TCCATGCAAT AAATAGGAGT 
GCCATAAATG GAATGATGAA ATATGACTAG AGGAGGAGAA AGGCTCCTAG ATGAGATGGG 
ATTTTAGGCA TCCGTGTCTC ATGAGGAATC AGTTGTGTCA CTAGGCAAAA CAGTAAAAAA 
AAAAACCTCC AAGTGAGTCT CTTATTTATT TTTTTCTTAT AAGACTTCTA CAAATTGAGG 
TACCTGGTGT AGTTTTATTT CAGGTTTTAT GCTGTCATTT TCCTGTAATG CTAAGGACTT 
AGGACATAAC TGAATTTTCT ATTTTCCACT TCTTTTCTGG TGTGTGTGTA TATATATATG 
TATATATACA CACACACATA TACATATATA TATTTTTTAG TATCTCACCC TCACATGCTC 
CTCCCTGAGC ACTACCCATG ATAGATGTTA AACAAAAGCA AAGATGAAAT TCCAACTGTC 
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AAAATCCCCC CTCCATCTAA TTAATCCCTC ACCCAACTAT GTTCCAAAAC GAGAATAGAA 1200 

AATTAGCCCC AATAAGCCCA GGCAACTGAA AAGTAAATGC TATGTTGTAC TTTGATCCAT 1260 

GGTCACAACT CATAATCTTG GAAAAGTGGA CAGAAAAGAC AAAAGAGTGA ACTTTAAAAC 1320 

TCGAATTTAT TTTACCAGTA TCTCCTATGA AGGGCTAGTA ACCAAAATAA TCCACGCATC 1380 

AGGGAGAGAA ATGCCTTAAG GCATACGTTT TGGACATTTA GCGTCCCTGC AAATTCTGGC 1440 

CATCGCCGCT TCCTTTGTCC ATCAGAAGGC AGGAAACTTT ATATTGGTGA CCCGTGGAGC 1500 

TCACATTAAC TATTTACAGG GTAACTGCTT AGGACCAGTA TTATGAGGAG AATTTACCTT 1560 

TCCCGCCTCT CTTTCCAAGA AACAAGGAGG GGGTGAAGGT ACGGAGAACA GTATTTCTTC 1620 

TGTTGAAAGC AACTTAGCTA CAAAGATAAA TTACAGCTAT GTACACTGAA GGTAGCTATT 1680 

TCATTCCACA AAATAAGAGT TTTTTAAAAA GCTATGTATG TATGTGCTGC ATATAGAGCA 1740 

GATATACAGC CTATTAAGCG TCGTCACTAA AACATAAAAC ATGTCAGCCT TTCTTAACCT 1800 

TACTCGCCCC AGTCTGTCCC GACGTGACTT CCTCGACCCT CTAAAGACGT ACAGACCAGA 1860 

CACGGCGGCG GCGGCGGGAG AGGGGATTCC CTGCGGCCCC GGACCTCAGG GCCGCTCAGA 1920 

TTCCTGGAGA GGAAGCCAAG TGTCCTTCTG CCCTCCCCCG GTATCCCATC CAAGGCGATC 1980 

AGTCCACAAC TGGCTCTCGG AAGCACTCGG GCAAAGACTG CGAAGAAGAA AAGACATCTG 2040 

GCGGAAACCT GTGCGCCTGG GGCGGTGGAA CTCGGGGAGG AGAGGGAGGG ATCAGACAGG 2100 

AGAGTGGGGA CTACCCCCTC TGCTCCCAAA TTGGGGCAGC TTCCTGGGTT TCCGATTTTC 2160 

TCATTTGCGT GGGTAAAAAA CCCTGCCCCC ACCGGCTTAC GCAATTTTTT TAAGGGGAGA 2220 

GGAGGGAAAA ATTTGTGGGG GGTACGAAAA GGCGGAAAGA AACAGTCATT TCGTCACATG 2280 

GGCTTGGTTT TCAGTCTTAT AAAAAGGAAG GTTCTCTCGG TTAGCGACCA ATTGTCATAC 2340 

GACTTGCAGT GAGCGTCAGG AGCACGTCCA GGAACTCCTC AGCAGCGCCT CCTTCAGCTC 2400 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 31 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TCCACCCGCA GTACAGAAAG TATCACAGGC T 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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AAAATCCCCC CTCCATCTAA TTAATCCCTC ACCCAACTAT GTTCCAAAAC GAGAATAGAA 1200 

AATTAGCCCC AATAAGCCCA GGCAACTGAA AAGTAAATGC TATGTTGTAC TTTGATCCAT 1260 

GGTCACAACT CATAATCTTG GAAAAGTGGA CAGAAAAGAC AAAAGAGTGA ACTTTAAAAC 1320 

TCGAATTTAT TTTACCAGTA TCTCCTATGA AGGGCTAGTA ACCAAAATAA TCCACGCATC 1380 

AGGGAGAGAA ATGCCTTAAG GCATACGTTT TGGACATTTA GCGTCCCTGC AAATTCTGGC 1440 

CATCGCCGCT TCCTTTGTCC ATCAGAAGGC AGGAAACTTT ATATTGGTGA CCCGTGGAGC 1500 

TCACATTAAC TATTTACAGG GTAACTGCTT AGGACCAGTA TTATGAGGAG AATTTACCTT 1560 

TCCCGCCTCT CTTTCCAAGA AACAAGGAGG GGGTGAAGGT ACGGAGAACA GTATTTCTTC 1620 

TGTTGAAAGC AACTTAGCTA CAAAGATAAA TTACAGCTAT GTACACTGAA GGTAGCTATT 1680 

TCATTCCACA AAATAAGAGT TTTTTAAAAA GCTATGTATG TATGTGCTGC ATATAGAGCA 1740 

GATATACAGC CTATTAAGCG TCGTCACTAA AACATAAAAC ATGTCAGCCT TTCTTAACCT 1800 

TACTCGCCCC AGTCTGTCCC GACGTGACTT CCTCGACCCT CTAAAGACGT ACAGACCAGA 1860 

CACGGCGGCG GCGGCGGGAG AGGGGATTCC CTGCGGCCCC GGACCTCAGG GCCGCTCAGA 1920 

TTCCTGGAGA GGAAGCCAAG TGTCCTTCTG CCCTCCCCCG GTATCCCATC CAAGGCGATC 1980 

AGTCCACAAC TGGCTCTCGG AAGCACTCGG GCAAAGACTG CGAAGAAGAA AAGACATCTG 2040 

GCGGAAACCT GTGCGCCTGG GGCGGTGGAA CTCGGGGAGG AGAGGGAGGG ATCAGACAGG 2100 

AGAGTGGGGA CTACCCCCTC TGCTCCCAAA TTGGGGCAGC TTCCTGGGTT TCCGATTTTC 2160 

TCATTTGCGT GGGTAAAAAA CCCTGCCCCC ACCGGCTTAC GCAATTTTTT TAAGGGGAGA 2220 

GGAGGGAAAA ATTTGTGGGG GGTACGAAAA GGCGGAAAGA AACAGTCATT TCGTCACATG 2280 

GGCTTGGTTT TCAGTCTTAT AAAAAGGAAG GTTCTCTCGG TTAGCGACCA ATTGTCATAC 2340 

GACTTGCAGT GAGCGTCAGG AGCACGTCCA GGAACTCCTC AGCAGCGCCT CCTTCAGCTC 2400 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TCCACCCGCA GTACAGAAAG TATCACAGGC T 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



- 94 - 



WO 96/40720 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GTGTTCCAGA TCCAGAGCTC ATTAAAACAG T 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Arg Xaa Xaa Xaa His 
1 5 
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(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GTGTTCCAGA TCCAGAGCTC ATTAAAACAG T 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Arg Xaa Xaa Xaa His 
1 5 
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1. An isolated DNA molecule encoding human PGHS-2. 

5 2. The isolated DNA molecule of Claim 1 encoding the 

amino acid sequence of human PGHS-2 as shown in FIG. 7 (SEQ. 
ID NO. 4) . 

3. An isolated DNA molecule that hybridizes under 

10 highly stringent conditions to the complement a DNA sequence 
encoding amino acid residues 426-436 or 567-577 of the human 
PGHS-2 amino acid sequence shown in FIG. 7 (SEQ. ID NO. 4) . 

4. A recombinant DNA vector containing the DNA 
15 sequence of Claim l f 2 or 3. 

5. A recombinant DNA vector containing the DNA 
sequence of Claim 1, 2, or 3 operatively associated with a 
regulatory sequence that controls gene expression in a host. 

20 

6. A genetically engineered host cell that contains 
the DNA of Claim 1, 2 or 3. 

7. A genetically engineered host cell that contains a 
25 sequence encoding mammalian PGHS-2 operatively associated 

with a regulatory sequence that controls gene expression, so 
that a PGHS-2 gene product is stably expressed by the host 
cell. 

30 8. The genetically engineered host cell of Claim 7 in 

which the mammalian PGHS-2 gene product is the human PGHS-2 
gene product. 

9. The genetically engineered host cell of Claim 8 in 
35 which the human PGHS-2 gene product has the amino acid 
sequence shown in FIG. 7 (SEQ. ID. NO. 4) . 
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1. An isolated DNA molecule encoding human PGHS-2. 

5 2. The isolated DNA molecule of Claim 1 encoding the 

amino acid sequence of human PGHS-2 as shown in FIG. 7 (SEQ. 
ID NO. 4) . 

3. An isolated DNA molecule that hybridizes under 

10 highly stringent conditions to the complement a DNA sequence 
encoding amino acid residues 426-436 or 567-577 of the human 
PGHS-2 amino acid sequence shown in FIG. 7 (SEQ. ID NO. 4) . 

4. A recombinant DNA vector containing the DNA 
15 sequence of Claim 1, 2 or 3. 

5. A recombinant DNA vector containing the DNA 
sequence of Claim 1, 2, or 3 operatively associated with a 
regulatory sequence that controls gene expression in a host. 

20 

6. A genetically engineered host cell that contains 
the DNA of Claim 1, 2 or 3. 

7. A genetically engineered host cell that contains a 
25 sequence encoding mammalian PGHS-2 operatively associated 

with a regulatory sequence that controls gene expression, so 
that a PGHS-2 gene product is stably expressed by the host 
cell. 

30 8. The genetically engineered host cell of Claim 7 in 

which the mammalian PGHS-2 gene product is the human PGHS-2 
gene product. 

9. The genetically engineered host cell of Claim 8 in 
35 which the human PGHS-2 gene product has the amino acid 
sequence shown in FIG. 7 (SEQ. ID. NO. 4) . 
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17. The genetically engineered host cell of Claim 7 , 8, 
10 , or 12 in which the PGHS-2 DNA is stably integrated into 
the host cell chromosome. 

5 18. The genetically engineered host cell of Claim 17 in 

which the host cell is a mammalian cell that does not express 
autologous PGHS-1. 



19. The genetically engineered host cell designated 
10 hPGHS-2 A2.7 p6 11/7/93 as deposited with the ATCC having 

accession no. , or progeny thereof. 

20. A method for producing mammalian PGHS-2, 
comprising: 

15 (a) culturing a genetically engineered host cell 

that contains a nucleotide sequence encoding 
mammalian PGHS-2 operatively associated with a 
heterologous regulatory sequence that controls 
gene expression, so that mammalian PGHS-2 is 

20 stably overexpressed by the host cell; and 

(b) recovering the mammalian PGHS-2 gene product 
from the cell culture. 

21. The method of Claim 20 in which the mammalian PGHS- 
25 2 gene product is the human PGHS-2 gene product. 

22. The method of Claim 21 in which the human PGHS-2 
gene product has the amino acid sequence shown in FIG. 7 
(SEQ. ID. NO. 4) . 

23. The method of Claim 20 in which the mammalian PGHS- 
2 gene product is the murine PGHS-2 gene product. 

24. The method of Claim 23 in which the murine PGHS-2 
35 gene product has the amino acid sequence shown in FIG. 1 

(SEQ. ID. NO. 2) . 
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17. The genetically engineered host cell of Claim 7, 8, 
10, or 12 in which the PGHS-2 DNA is stably integrated into 
the host cell chromosome. 

5 18. The genetically engineered host cell of Claim 17 in 

which the host cell is a mammalian cell that does not express 
autologous PGHS-1. 

19. The genetically engineered host cell designated 
10 hPGHS-2 A2.7 p6 11/7/93 as deposited with the ATCC having 

accession no. , or progeny thereof. 

20. A method for producing mammalian PGHS-2, 
comprising: 

15 (a) culturing a genetically engineered host cell 

that contains a nucleotide sequence encoding 
mammalian PGHS-2 operatively associated with a 
heterologous regulatory sequence that controls 
gene expression, so that mammalian PGHS-2 is 

20 stably overexpressed by the host cell; and 

(b) recovering the mammalian PGHS-2 gene product 
from the cell culture. 

21. The method of Claim 20 in which the mammalian PGHS- 
25 2 gene product is the human PGHS-2 gene product. 

22. The method of Claim 21 in which the human PGHS-2 
gene product has the amino acid sequence shown in FIG. 7 
(SEQ. ID. NO. 4) . 

30 

23. The method of Claim 20 in which the mammalian PGHS- 
2 gene product is the murine PGHS-2 gene product. 



24. The method of Claim 23 in which the murine PGHS-2 
35 gene product has the amino acid sequence shown in FIG. 1 
(SEQ. ID. NO. 2) . 



WO 96/40720 



PCT/US96/08311 



25. The method of Claim 20 in which the DNA encoding 
the mammalian PGHS-2 gene product is; 

(a) a DNA sequence that hybridizes under stringent 
conditions to the complement a DNA sequence 

5 encoding the amino acid sequence of human 

PGHS-2 shown in FIG. 7 (SEQ. ID. NO. 4); or 

(b) a DNA sequence that hybridizes under stringent 
conditions to a DNA sequence encoding the 
amino acid sequence of murine PGHS-2 shown in 

10 FIG. 1 (SEQ. ID. NO. 2). 

26. The method according to Claim 20, 23 , or 25 in 
which the genetically engineered host cell is a mammalian 
host cell that does not express autologous PGHS-2. 

15 

27. The method according to Claim 20, 23, or 25 in 
which the PGHS-2 DNA is stably integrated into the host cell 
chromosome . 



20 28. The method according to Claim 20 in which the 

genetically engineered host cell is designated hPGHS-2 A2.7 
p6 11/7/93 as deposited with the ATCC having accession no. 
, or progeny thereof. 

25 29. A recombinant DNA molecule encoding a fusion 

protein comprising the mammalian PGHS-2 gene product, or a 
peptide fragment thereof, linked to a peptide or protein. 

30. The recombinant DNA molecule of Claim 29 in which 
30 the PGHS-2 gene product is the human PGHS-2 gene product. 

31. The recombinant DNA molecule of Claim. 29 in which 
the PGHS-2 gene product is the murine PGHS-2 gene product. 

35 32. A recombinant DNA vector containing the DNA 

sequence of Claim 29, 30 or 31. 
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25. The method of Claim 20 in which the DNA encoding 
the mammalian PGHS-2 gene product is: 

(a) a DNA sequence that hybridizes under stringent 
conditions to the complement a DNA sequence 

5 encoding the amino acid sequence of human 

PGHS-2 shown in FIG. 7 (SEQ.* ID. NO. 4); or 

(b) a DNA sequence that hybridizes under stringent 
conditions to a DNA sequence encoding the 
amino acid sequence of murine PGHS-2 shown in 

10 FIG. 1 (SEQ. ID. NO. 2). 

26. The method according to Claim 20 , 23, or 25 in 
which the genetically engineered host cell is a mammalian 
host cell that does not express autologous PGHS-2. 



15 



27. The method according to Claim 20, 23, or 25 in 
which the PGHS-2 DNA is stably integrated into the host cell 
chromosome . 



20 28. The method according to Claim 20 in which the 

genetically engineered host cell is designated hPGHS-2 A2.7 
p6 11/7/93 as deposited with the ATCC having accession no. 
, or progeny thereof. 

25 29. A recombinant DNA molecule encoding a fusion 

protein comprising the mammalian PGHS-2 gene product, or a 
peptide fragment thereof, linked to a peptide or protein. 

30. The recombinant DNA molecule of Claim 29 in which 
30 the PGHS-2 gene product is the human PGHS-2 gene product. 

31. The recombinant DNA molecule of Claim. 29 in which 
the PGHS-2 gene product is the murine PGHS-2 gene product. 

35 32. A recombinant DNA vector containing the DNA 

sequence of Claim 29, 30 or 31. 
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33. A recombinant DNA vector containing the DNA 
sequence of Claim 29, 30 or 31 operatively associated with a 
regulatory sequence that controls gene expression in a host. 

5 34. A genetically engineered host cell that contains 

the DNA sequence of Claim 29 , 30 or 31 operatively associated 
with a regulatory sequence that controls gene expression so 
that a PGHS-2 fusion protein is expressed by the host cell. 

10 35. A method for identifying a compound that inhibits 

prostaglandin synthesis catalyzed by mammalian PGHS-2 
comprising: 

(a) contacting the genetically engineered cell of 
Claim 7, with the compound in the presence of 

15 a pre-determined amount of arachidonic acid; 

(b) measuring the conversion of the arachidonic 
acid to its prostaglandin metabolite; and 

(c) comparing the amount of arachidonic acid 
converted by the cells exposed to the test 

20 compound to the amount of arachidonic acid 

converted by control cells that were not 
exposed to the test compound. 

36. The method of Claim 35 in which the genetically 
25 engineered cell is the host cell of Claim 17. 

37. The method of Claim 35 in which the genetically 
engineered cell is the host cell of Claim 18. 

30 38. The method of Claim 35 in which the genetically 

engineered host cell is designated hPGHS-2 A2.7 p6 11/7/93 as 
deposited with the ATCC having accession no. ... , or progeny 
thereof . 

35 39. A method for identifying a compound that inhibits 

prostaglandin synthesis catalyzed by mammalian PGHS-2, but 
does not inhibit the actuity of PGHS-l, comprising: 
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33. A recombinant DNA vector containing the DNA 
sequence of Claim 29, 30 or 31 operatively associated with a 
regulatory sequence that controls gene expression in a host. 

5 34. A genetically engineered host cell that contains 

the DNA sequence of Claim 29, 30 or 31 operatively associated 
with a regulatory sequence that controls gene expression so 
that a PGHS-2 fusion protein is expressed by the host cell. 

10 35. A method for identifying a compound that inhibits 

prostaglandin synthesis catalyzed by mammalian PGHS-2 
comprising: 

(a) contacting the genetically engineered cell of 
Claim 7, with the compound in the presence of 

15 a pre-determined amount of arachidonic acid; 

(b) measuring the conversion of the arachidonic 
acid to its prostaglandin metabolite; and 

(c) comparing the amount of arachidonic acid 
converted by the cells exposed to the test 

20 compound to the amount of arachidonic acid 

converted by control cells that were not 
exposed to the test compound. 

36. The method of Claim 35 in which the genetically 
25 engineered cell is the host cell of Claim 17. 

37. The method of Claim 35 in which the genetically 
engineered cell is the host cell of Claim 18. 

30 38. The method of Claim 35 in which the genetically 

engineered host cell is designated hPGHS-2 A2.7 p6 11/7/93 as 
deposited with the ATCC having accession no. ... , or progeny 
thereof . 

35 39. A method for identifying a compound that inhibits 

prostaglandin synthesis catalyzed by mammalian PGHS-2, but 
does not inhibit the actuity of PGHS-1, comprising: 
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(a) contacting a genetically engineered cell that 

expresses mammalian PGHS-2, and not mammalian PGHS- 

1, with the compound in the presence of a pre- 
determined amount of arachidonic acid; 

5 (b) contacting a genetically engineered cell that 

expresses mammalian PGHS-1, and not mammalian PGHS- 

2, with the compound in the presence of a 
predetermined amount of arachidonic acid; 

(c) measuring the conversion of arachidonic acid to its 
10 prostaglandin metabolite; and 

(d) comparing the amount of arachidonic acid converted 
by each cell exposed to the test compound to the 
amount of arachidonic acid converted by control 
cells that were not exposed to the test compound, 

15 so that compounds that inhibit PGHS-2 and not PGHS- 

1 activity are identified. 

40. The method of Claim 39 in which the PGHS-2 
expressing cell line is designated hPGHS-2 A2.7 p6 11/7/93 as 

20 deposited with the ATCC having accession no. , or 

progeny thereof. 

41. The method of Claim 39 in which the PGHS-1 
expressing cell line is designated A1.2 p5 2/20/95 as 

25 deposited with the ATCC having accession no. , or 

progeny thereof. 

42. A method for inhibiting prostaglandin synthesis in 
a mammalian host, comprising administering a compound that 

30 inhibits the expression or activity of the PGHS-2 gene 
product to a patient in need of such treatment. 

43. The method of Claim 42 in which the compound is an 
antisense or ribozyme molecule that blocks translation of the 

35 PGHS-2 gene product. 
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(a) contacting a genetically engineered cell that 

expresses mammalian PGHS-2, and not mammalian PGHS- 

1, with the compound in the presence of a pre- 
determined amount of arachidonic acid; 

5 (b) contacting a genetically engineered cell that 

expresses mammalian PGHS-1, and not mammalian PGHS- 

2, with the compound in the presence of a 
predetermined amount of arachidonic acid; 

(c) measuring the conversion of arachidonic acid to its 
10 prostaglandin metabolite; and 

(d) comparing the amount of arachidonic acid converted 
by each cell exposed to the test compound to the 
amount of arachidonic acid converted by control 
cells that were not exposed to the test compound, 

15 so that compounds that inhibit PGHS-2 and not PGHS- 

1 activity are identified. 

40. The method of Claim 39 in which the PGHS-2 
expressing cell line is designated hPGHS-2 A2.7 p6 11/7/93 as 

20 deposited with the ATCC having accession no. , or 

progeny thereof. 

41. The method of Claim 39 in which the PGHS-1 
expressing cell line is designated A1.2 p5 2/20/95 as 

25 deposited with the ATCC having accession no. , or 

progeny thereof. 

42. A method for inhibiting prostaglandin synthesis in 
a mammalian host, comprising administering a compound that 

30 inhibits the expression or activity of the PGHS-2 gene 
product to a patient in need of such treatment. 

43. The method of Claim 42 in which the compound is an 
antisense or ribozyme molecule that blocks translation of the 

35 PGHS-2 gene product. 



- 101 



WO 96/40720 



PCT/US96/08311 



44. The method of Claim 42 in which the compound is a 
DNA molecule complementary to the 5' region native to the 
PGHS-2 gene so that a triple helix is formed and 
transcription of the PGHS-2 gene is inhibited or prevented. 

5 

45. The method of Claim 42 in which the compound 
inhibits the enzymatic activity of the PGHS-2 gene product, 
and has minimal effect on enzymatic activity of PGHS-l. 

10 46. The method of Claim 42 which is used to treat 

inflammation. 

47. The method of Claim 42 which is used to treat 
arterial inflammation or pulmonary fibrosis. 

15 

48. The method of Claim 42 which is used to treat 
Alzheimer's disease, stroke or acute head injury. 

49. The method of Claim 42 which is used to treat 
20 endometriosis, dysmenorrhea, or pre-term labor. 

50. The method of Claim 42 which is used to treat 
cancers in which mammalian PGHS-2 is expressed or induced. 

25 51. The method of Claim 50 in which the cancer is 

prostate cancer, colorectal cancer, squamous cell carcinoma 
of the head or neck, breast cancer, oral pharyngeal cancer, 
stomach cancer, fibrosarcoma, skin cancer, or osteosarcoma. 

30 52. The method of Claim 42 which is used to treat 

radiation induced injury to the gastrointestinal tract, the 
brain, the lungs, hematopoietic tissues, or lymphocytes. 

53. A method for detecting the expression of PGHS-2 in 
35 a patient sample, comprising: 

(a) contacting a cell lysate or tissue section derived 
from the patient with a single-stranded nucleotide 
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prostate cancer, colorectal cancer, squamous cell carcinoma 
of the head or neck, breast cancer, oral pharyngeal cancer, 
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30 52. The method of Claim 42 which is used to treat 

radiation induced injury to the gastrointestinal tract, the 
brain, the lungs, hematopoietic tissues, or lymphocytes. 

53. A method for detecting the expression of PGHS-2 in 
35 a patient sample, comprising: 

(a) contacting a cell lysate or tissue section derived 
from the patient with a single-stranded nucleotide 
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sequence that is complementary to PGHS-2 mRNA under 
conditions which permit hybridization; 
(b) detecting whether hybridization of the single- 
stranded nucleotide sequence to the mRNA in the 
5 sample has occurred* 

54. A method for detecting the expression of PGHS-2 in 
a patient sample, comprising: 

(a) contacting a cell lysate or tissue section derived 
10 from the patient with an antibody that 

immunospecif ically binds to the PGHS-2 enzyme under 
conditions which permit antibody-antigen binding; 
and 

(b) detecting whether the antibody bound to the patient 
15 sample. 

55. The method of Claim 53 or 54 which is used to 
diagnose cancers in which expression of PGHS-2 is induced. 

20 56. The method of Claim 55 in which the cancer is 

prostate cancer, colorectal cancer, squamous cell carcinoma 
of the head or neck, breast cancer, oral pharyngeal cancer, 
stomach cancer, fibrosarcoma, skin cancer, or osteosarcoma. 

25 57. A genetically engineered host cell in which the 

native PGHS-2 gene sequence is disrupted so that expression 
of the native PGHS-2 gene product is inhibited or prevented. 
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FIG. I 

25 CnCAKAGTCACTCAGGACTCTGCTCACGAAGGAACTCAGCACTGCATCCTGCCAGwTC 64 

£5 CACCGCCACCACTACTGCCACCTCCGCTGCCACCTCTGCGATGCTCT7CCGAGCTGTGCT 144 

H L F R A V L 7 

145 GCTCTGCGCTGCCCTGGGGCTCAGCCAGGCAGCAAATCCTTGCTGTTCCAATCCATGTCA 204 
6 LCAALGLSO ATA RFCCSHFCQ 27 

205 AMCCGTGGGGAATCjTATGAGCACA^TrrGACCAGTATMCnCTGACTGTACCCGGAC 264 

2BHRGECMST6FD0YKCOCTRT 47 

265 TGGATTCTATGGTGAAAACTGTACTACACCTGAAraCTGACAAGAATCAAAn^ 324 

48 G F Y G E W C T TPEFLTRIKLiL 67 

325 GAAGCCCACCCCAAACXCXGTSCACTACATCCTGACCCACTTCM 3S4 

6B KPTPNTVHYILTHFKGVWNI 87 

385 TGTGAACAACATCCCCnCCKCGAAGmAATCATGAW 444 

86 VNNIPFLRSLIMKYVLTSRS 107 

445 ATATTTGATTGACAGTCCACCIACTTACAATGTGCACTATGGnACAAAAGCTGGGAAG 504 

108 Y L I D S P P T Y F V H Y G Y K S W E A 127 

505 CnCTCCAACCTCKCIACTACACCAGGGCCCrTCCT 564 

128 F s JU kJ j YYTRALPPVADDCPT 147 

565 TCCCATKSTiM^ 624 

146 PHGVKGRKELPDSKEVLEXV 167 
625 TOTCTACGGAGAGAGnCATCCCItt^ 6B4 
168 L L R R E F I PD P QGSHMMFAFF 187 
6B5 TGCCCAGCACTCCACCCATCAGTTOTCAAGACAGATCAIAAGC 744 
188 AOHFTHOFFKTDHICR G PG FT 207 
745 COGA^CT^GCCATGGAGTGGACTTAAAra 804 
208 R G h G H VDLBH1YGETLDRQ 227 
805 ACATAAACTGCGCCTTTTCAAGGATGGAAAATTGAAATATCAGCjTCA 864 
228 HKLRLFKDGKLKYQVIGGEV 247 
865 GTATCCCCCCACAGTCAAAGACACTCAGGTAGAGATGATCTACCCTCCTCACATCCCTGA 924 
246 YPPTVKDTQVEHIYPPHIPE 267 
925 GAACCTGCAGnTGCTGTGGGGCAGGAAGTCTTTGGTCTGGT^ 984 
26B KLOFAVGOEVFGLVPGLMMY 287 
965 TGCCACCATCrGGOTCGGGAGCACAACAGAGTGTGCGACATACTCAAGCAGG^ 1044 
286 ATIWLREHRRVCDILKQEHP 307 

1045 TGAGTGGGGTGATGAGCAACTATTCCAAACCAGCAGACTCATACTCATACk^GAGACTAT 1104 

308 EWGDEQLFQTSRLILIGETI 327 

1105 CAAGATAGTGATCGAA^CTACGTG^CACCTGAGCGGTTACCACTTCAAACTCMGTT 1164 

328 K 1 V I E D YVOHLSGYHFKLKF 347 

1165 TGACCCAGAGCTCCTTTTCMCCAGCAGTrCCAGTATCAGAACCGCATTGCCrCTGAAn 1224 

34E DPELLFNQQFQYQKRIASEF 367 

1225 CAACACACTCTATCACTGGCACwCCCTGCrGwCCGACACCTTCAACATTGAAGACCAGGA 1284 

36E NTLYHWHPlrLPDTFKI EDQE 367 

1285 GTACAGCnTAAACAGTTTCTCTACAACAACTCCATCCTCCTGGAACATGGACTCACTCA 1344 

388 YSFKOFLY V V S ILLEHGLTQ 407 

1345 CnrrOTGAGTCArrCACaGACACaTTG H04 

408 FVESFTRQIAGRVAGGRHVP 427 

1405 AATTGCTGTACAAGCAGTGGCAAAGGCCTCCAn^ 1464 

428 IAVJOAVAKASIDOSREMKYO 447 

1465 CTCTCTCAATuAGTAnCGGAAACGOTCTCCCTGAAGCCGTACACATCATrTGAAG^ 1524 

448 SLKEXRKRF5LKPYTSFEEL 467 

1525 TACAGC^GAGAAGGAAATGGCTGaGAATTGAMGCCCra 1584 

468 T G E K E MA. A ELKALYSDIDVK 487 

1585 GGAACTGTACCCTGCCCTGCTGGTGGAAAAACCTCGTCCAGWGCTATCOT 1644 

488 ELY PA LiVEKPRPDAIFGET 507 

1645 CATGCTAGAGCTTC^GCACCACTCTCCTTC^ 1704 

508 MVELGAPF ®l K G L KG N PICS 527 

1705 TCCTCAATACTGGAAGCCGAGCACCTTTGGAGGCGAAG 1764 

528 PQYWKPSTFGGEVGFKII NT 547 

1765 TGCCTCAATTCAGTCTCTCATCIGCAATMTGTGAAGGGGTt^CCOTCACTO 1824 

54B A5IQSLICRKVKGCPFTSFK 557 

1825 TGTGCAAGATCCACAGCCTACCAAAACAGCCACCATCAATGCAAGTGCCTCCCACTCCAG 1884 

56c VQO PQPTKTATI 'V A S A 5 K S R 5E~ 

1685 ACTAGATGACAnAACCCTACAGTACTAATCAAAAG(J^n^CTGAGCTGTAAAAGTC 1944 

58E L D D I £_? T.3LLLI:. R R S T E L 607 
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FIG. I 

25 CTTCACGAGTCAGTCAGGACTCTGCTCACGAAGGAACTCAGCACTGCATCCTGCCAGrTC E< 

65 CACCGCCACCACTACTGCCACCTCCGCTGCCACCTCTGCGATGCTCTTCCGAGCTGTuCT 144 

M L F R A V I 7 

145 GCTCTGCGCTGCCCTGGSGCTCAGCCAGSCAGCAMTCCTTGCTGTtCCAATCCATGTCA 204 

E LCAALGISOAUNPCCSNPCQ 27 

205 AAACCGTGGGGAATGTATGAGCACAGWTntlACCAGTATAAGTGTGACTGTACCCGGAC 264 

2B HRGECKSTGFDOYKCDCTRT 47 

265 TGGATrCTATGGTGAAAACTGTACTACACCTGAATnCTGACAAGAATCAAATTACTGCT 324 

IB G F Y G E M C T TPEFLTRIKILI 67 

325 GMGCCCACCCCAAACACAGTGCACTACATCCTGACCCACTTCAAGGGAC?rCTMAACAT 384 

6B KPTPRTVHYILTHFXGVHNI 87 

385 TGTGAACAACATCCCCrTCCTGCGAAGOTAATCATGAAATAIGTGCTGACATCCAGATC 444 

88 VKNI PFLRSLIMKYVLTSRS 107 

445 ATATTTGATTGACAGTCCACCTACirACAATGTGCACTATGGTTACAAAAGCTGGGAAGC 504 

108 YLIDSPPTYHVHYGYKSWEA 127 

505 CTTCTCCAACCTCTCCTACTACACCAGGGCCCTTCCTCCCGTAGCAGATGACTGCCCMC 564 

128 F s JU L. ? YYIRALPPVADDCPT 147 

565 TCCCATG&IGTGAAGGGAAATAAt^GOTCCTGATTCAAAAGAAGTGCTKAAMGCT 624 

148 PMGVXGNKELPDSKEVLEKV 167 

625 TCRCIACGGAGAGAGnCATCCCTGACCCCCAAGGCTCAAATATGATGTTTGCAnCTT 664 

168 L L R R £ F I PD P OGSNMMFAFF 187 

685 TGCCCAGCACTTCACCCATCAGTrrnCMGACAGATCAIAAGCGAGGACCTGGGnCAC 744 

188 AOHFTHQFFRTDHKRGPGFT 207 

745 C^GGACTGGGCCMGttGTGGAOTAAATCACArnATGGTGAAACT 804 

208 R G L G H -C VDLRHIYGETIDRO 227 

805 ACATAMCTGCGCCnTTCAAGGATGGAAAATTGAAATATCAGGTCATTGQTGGAGAGOT 864 

228 HKLRLFKDGKLKYQVIGGEV 247 

865 GTATCCCCCCACAGTCAAAGACACTCAGGTAGAGATGATCTACCCTCCrCACATCCCTGA 924 

24E YPPTVKDTOVEH1YPPHIPE 267 

925 GAACCTGCAGTTTGCTGTGGGGCAGGAAGTCTTIGGTCTGGTGCCTGGTCTGATGATGTA 984 

268 RLQFAVGOEVFGLVPGLHHY 287 

985 TGCCACCATCTGGCTTCGGGAGcACAACAGAGTGTGCGACATACTCAAGCAGGAGCAtCC 1044 

288 ATIWLREHNRVCDILKOEHP 307 

1045 TGAGTGG43G7GATGAGCAACTATTCCAMCCAGCAGACTCATACTCATAGGAGAGACTAT 1104 

308 EWGDEOIFOTSRLILIGETI 327 

1105 CAAGATAGTGATCGAAGACTACGTGCAACACCIGAGCGGTTACCACTTCAAACTCAAGT7 1164 

328 K 1 V I ED Y VOHLSGYHFKLKF 347 

1165 TGACCCAGAGCTCCTTTTCAACCAGcAGTTCCAGTATCAGAACCGCATTGCCTCrGAATT 1224 

34E D P £ L L F R 0 0 FQYQKRIASEF 367 

1225 CAACACACTCTATCACTGGCACCCCCrGCrGcCCGACACCTTCAACATTGAAGACCAGGA 12E< 

36E KTLYBHRPLLPDTFKI E D Q E 3E7 

12E5 GTACAGCITTAAAUGTTTCTCTACAACAACTCCATCCTCCIGGAACATGGACTCACTCA 1344 

388 YSFKOFiY B V S ILIEHGLTQ 407 

1345 GTTrGTTGAGTCATTCACCAGACAdSTTScTGGCCGGGTIGCTGGGGGM 1404 

408 FVESFTRQIAGRVAGGRRVP 427 

1405 AATTGCTGTACAAGCAGTGGCAAAGGCCTCCAtTGACCAGAGCAGAGAGATGAAATACCA 1464 

428 IAV_OAVAKASID OS R E M K Y Q 447 

1465 GTCTCTCAATGAGTAnCGGAMCGOTCTCCCTGMGcCGTACACATCATrTGAAGMCT 1524 

448 SLREXRXRFSLKPYTSFEEL 467 

1525 TACAGGAGAGAAGGAAATCGCTGCAGAATTGAAAGCCCTCIACAGTGACATCGATGTCAT 1584 

468 T G E K E MA AELKALYSDIDVK 487 

15B5 GGAACTGTACCCTGCCCTGCTGGTGGAAAAACCTCGTCCAGAIGCrATCnTGGGGAGAC 1644 

48£ ELYPAIIVEKPRPDA1FGET 507 

1645 CATGGTAGAGCntXSAGCACCATTOTCTTGAAAGGACnATGGGAAATCCCATCTGnC 1704 

508 « V E L G A P F_(f) L t G L B G H PICS 527 

1705 TCCTCAATACTGGAAGCCGAGCACCITTGGAGGCGAA GTGGGniTA AGATCAICAATA; 1764 

528 PQYWKPSTFGGEVGFKIIR? 547 

1765 TCCCTCMTTCAGTCTCTCATCIGCAMAATGTGAA GG GCTCrCCCnCACll C in'CAA 1824 

548 ASIQSLICRKVXGCPFTSFK 55" 

1825 TGTGCAAGATcCACAGCCrACCAAAACAGCCACCATCMTGCAAGTGCCTCCCACTCCAC 1884 

56E VQD PQPTKTATI •* A S A S K S K SET 

1885 ACTAGATGACATTAACCCTACAGTACTAATCAAMGGW^^CTGAGCTGTAAAAGTC 1944 
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FIG. 6A 



90 CCGCTGCGATGCTCGCCCGCGCCCTGCTGCTGTG^ 149 
CAAATCCnGCTOTtmCCC&T^^ 20g 
ACCMTATAAGIG0GAITGT«3^ 26g 

AWTTTTGACAAGMTAAAAm^ 329 

WACCCACTTOAGGGAT^^ 3fi9 

TTATGAGTTATGTGITGACATCCAGATCACA^nTG^ U3 

CTGACTATGGCTACAAAAGCTGGGAAGOCTrcTCCAACCICTCCM 5 09 

TTCCTCCTGTGCCTGATGATTGCCCGACTCCCTTGGGT^^ 569 

570 CTGATO^AATGAGATn^GGAAAAATTOT 629 

630 AGGGCTCAMCATGATGTTTGCATTCTTTCCG^ 689 

• • • # 

690 CACMCATAAGCtMGGCtt^ ?49 

' • . . 

750 ATATOACGGTGAMCTCTGGCTAGACAGCGTAAACTGCGOT 809 

* • • . 

810 TGAAAIATCAGATAATTOATGGAGAGATGTATCCICCC^^ 869 

870 AGATGATCTACCCTCOm^ g29 

930 noGtciosreccreGicT^^ 989 
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FIG. 6A 



90 CCGCTGCGAT^GCC^ ^ 

CAMTCCTTGCTGTTCCCACXrCATGTCAAA^ m 

ACCAGTATAAGTGCGATTGTACCCGGACAGGATTCTATCGAGAAW 269 

AATTTTTGAC^GMTAAMTTATITCTGAAACCCACl^^ 329 

TTACCCACTTCAAGGGATrTTGGAACGTTGIt^TAACA^^ 389 

n^CTATGGCTACAAAAGCTGCmtmTCTCm 509 

TOCTCCTGTCOT^T^TTGCCCG^ 559 

CR^ITCAAATGAGATTCTCGAAAAATTGCTICT 62 g 

AGGGCTCAAACATGATCTT^^ gg9 

690 CACIATC^TMGCGAGGGCCAGCTITCACCAACGGGCTG^ 749 

750 ATAriTACGGTGAMCTCTGOTAGACAGCCTAAACTGCG(^ 809 

810 TGAAATATCAGATAATOATGGAGAGATCTAIX^^ 869 

* • • # 

870 AGATGATCTACCCTCCTCAAGTCCCTGAGCAICTAC^^ S29 

930 TOGlCraGT^CC^^ ^ 

* * * • 

990 TATGCGATCKmAAAC^GGAGCATCCTGAA 1049 
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FIG. 6B 



1050 

1110 

1170 

1230 

1290 

1350 

1410 

1470 



(X^GGCTMTACTGATAGGAGAGACTATO UM 
TGMnra^TCACTTCAAAC^^ UM 

ACTACmMTCCTATTGCTGCTGAATimCACCCTC^ 1225 
CTGACACCTTTCAAATTCATGACCAGAAATACAACrATCA^ 1289 
CTATATTGCTGGAACATGGAATTAOOCAGTnGTI^ 134 g 
GCAGGGTTGCnXSTGCTAGGAATC 1409 

TIUACCAGAGCAGGCAGATGAAATACQGTCTTTTAATGACTAOC^^ 1469 

• • • m 

TGAAGCCCTATGAATCAm^(MCTTACAGGAGAAAA« 1529 

1530 MGCACTCTATGGTGACATCGATGCTGTGGAGCTGTATCCT^^ 1589 

1590 CTCG(mGATGCCATClTTGGTGAAACa 1Mg 

1650 AAGGACTTATGOTMTCrrATATGTTCTCCT^ 1709 

• • # 

1710 GaGAAGTGSaTTTCAM 17fi9 

1770 TGMGGGCTGICCCTTTACTTCATTCAGTC^^ 1829 

1830 CCMCWOO^OTCT^^ 1889 
1890 AAGAACGTTCGACTGAACTGTAGAAG1CTAATAC 1923 
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FIG. 6B 
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1170 
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1650 
1710 
1770 



GCAGOTAATACTGATAGGAGAGACTATTMG^^ UM 

TGAGTGGCTATC^CTTCAAACTGMGTTTGACCCAGAACTAOT 1169 

* • « 

AGTACCAAMTCGTATTGCTGCTGMTTIMC^CCCTCTATCA^ 1229 

CTGACACCTTTCAMTTCATGACCAGAAATACAACTATCAAW 1289 

CTATATTOCTGSAACATGGMmCC^ 134g 

GCAGGGTTGCIXJGTGGTAGGAATGTrcCACCCGCAGTACA 1409 

TTGACCAGAGCAGGCAGATGAMTACCAGTCTTITAATG^ 1469 

* • • 9 

TGAAGCCCTATGMTCATITGMGAAOTACAGQAGAAAAGGAAATGTC^^ 1 529 

• • . 

AAGCACTCTATGGTGACATCGATGCTGTSjAGCTCTAT^^ 1589 

1590 CTCGGCCACMGCCATCrTTQGTGAM 1649 

AAGGACTTATGGGTAATGTTATATGTiriCCTGOT 1709 

GAGAAtnxmTTTCAMTCATCMCACT^CCT^ 1769 

TGAAGGGCTGTCCCTTTACTTCATT^ 1829 

1830 CCATmTGCMGTTCTimXKrrCCGGACTAGA 1889 
1890 AAGAACGnCGACTGAACTGTAGAAGTCTAATAC 1923 
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FIG. 7 

hPGHS-2 
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FIG. 7 

hPGHS-2 MLARALLLCA VLALSHTANP CCSHPCQNRG VCHSVGFDQY KCDCTRTGFY 

si uum ua&i&i uamm ^ 
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